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Abstract 

We present optimal online algorithms for two related known problems involving Steiner 
Arborescence, improving both the lower and the upper bounds. One of them is the well studied 
continuous problem of the Rectilinear Steiner Arborescence (RSA). We improve the lower 
bound and the upper bound on the competitive ratio for RSA from O(logiV) and 0(Vlog N) 
to 0( A ); where N is the number of Steiner points. This separates the competitive ratios 
of RSA and the Symetric-RSA (SRSA), two problems for which the bounds of Berman and 
Coulston is STOC 1997 were identical. The second problem is one of the Multimedia Content 
Distribution problems presented by Papadimitriou et al. in several papers and Charikar et al. 

SODA 1998. It can be viewed as the discrete counterparts (or a network counterpart) of RSA. 

For this second problem we present tight bounds also in terms of the network size, in addition 
to presenting tight bounds in terms of the number of Steiner points (the latter are similar to 
those we derived for RSA). 

Keywords: Online Algorithm, Approximation Algorithm, Video-on-Demand 

1 Introduction 

Steiner trees, in general, have many applications, see e.g. |12] for a rather early survey that already 
included hundreds of items. In particular, Steiner Arborescence^ are useful for describing the 
evolution of processes in time. Intuitively, directed edges represent the passing of time. Since there 
is no way to go back in time in such processes, all the directed edges are directed away from the 
initial state of the problem (the root), resulting in an arborescence. Various examples are given 
in the literature such as processes in constructing a Very Large Scale Integrated electronic circuits 
(VLSI), optimization problems computed in iterations (where it was not feasible to return to results 
of earlier iterations), dynamic programming, and problems involving DNA, see, e.g. [H [6l IT3| [3]. 
Papadimitriou at al. [igiis] and Charikar et al. [5] presented the discrete version, in the context 
of Multimedia Content Delivery (MCD) to model locating and moving caches for titles on a path 
graph. The formal definition of (one of the known versions ) of this problem, Directed-MCD, 
appears in Section [2l 
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0937274 and AFOSR FA9550-13-1-0042. 
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the ISF, Israeli ministry of science and by the Technion Gordon Center. 

^A Steiner arborescence is a Steiner tree directed away from the root. 
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We present new tight lower and upper bounds for two known interrelated problems involving 
Steiner Arborescences: Rectilinear Steiner Arborescence (dlSAJ and Directed-MCD (DMCD). 
We also deal indirectly with a third known arborescence problem: the Symmetric-USA (SRSA) 
problem by separating its competitive ratio from that of RSA. That is, when the competitive 
ratios of RSA and SRSA were discussed originally by Berman and Coulston [3], the same lower 
and upper bounds were presented for both problems. 

The RSA problem: This is a rather heavily studied problem, described also e.g. in [MllIIlSl 
EZllg]. A rectilinear line segment in the plane is either horizontal or vertical. A rectilinear path 
contains only rectilinear line segments. This path is also y-monotone (respectively, x-monotone) if 
during the traversal, the y (resp., x) coordinates of the successive points are never decreasing. The 
input is a set of requests TZ = {n = {xi,yi), ...,rN = ixN,yN)} called Steiner terminals (or points) 
in the positive quadrant of the plane. A feasible solution to the problem is a set of rectilinear 
segments connecting all the N terminals to the origin tq = (0,0), where the path from the origin to 
each terminal is both x-monotone and y-monotone (rectilinear shortest path). The goal is to find 
a feasible solution in which the sum of lengths of all the segments is the minimum possible. The 
above mentioned third problem, SRSA was defined in the same way, except that the above paths 
were not required to be x-monotone (only y-monotone). 

Directed-MCD defined in Section [2] is very related to RSA. Informally, one difference is that it 
is discrete (Steiner points arrive only at discrete points) whiling RSA is continuous. In addition, 
in DMCD each “X coordinates” represents a network nodes. Hence, the number of X coordinates 
is bounded from above by the network size. This resemblance turned out to be very useful for us, 
both for solving RSA and for solving DMCD. 

The online version of RSA |i4j: the given requests (terminals) are presented to the algorithm 
with nondecreasing y-coordinates. After receiving the i’th request Xj = (xj,yi) (for i = 
the on-line RSA algorithm must extend the existing arborescence solution to incorporate r*. There 
are two additional constraints: (1) a line, once drawn (added to the solution), cannot be deleted, 
and (2) a segment added when handling a request Xj, can only be drawn in the region between 
yi-i (the y-coordinates of the previous request Xi_i) and upwards (grater y-coordinates). If an 
algorithm obeys constraint (1) but not constraint (2), then we term it a pseudo online algorithm. 
Note that quite a few algorithms known as “online”, or as “greedy offline” fit this definition of 
“pseudo online”. 

Additional Related works. Online algorithms for RSA and SRSA were presented by Berman 
and Coulston [3]. The online algorithms in [3] were O(logX) competitive (where N was the number 
of the Steiner points) both for RSA and SRSA. Berman and Coulston also presented D(\/log N) 
lower bounds for both continuous problems. Note that the upper bounds for both problems were 
equal, and were the squares of the lower bounds. A similar gap for MCD arose from results of 
Halperin, Latombe, and Motwani who gave a similar competitive ratio of O(logX), while 
Charikar, Halperin, and Motwani [5] presented a lower bound of H(\/log n) for various variants of 
MCD, where n was the size of the network. Their upper bound was again the square of the lower 
bound: 0(min{log n, log A}) (translating their parameter p to the parameter n we use). 

Berman and Coulston also conjectured that to close these gaps, both the upper bound and the 
lower bound for both problems could be improved. This conjecture was disproved in the cases of 
SRSA and of MCD on undirected line networks m- The latter paper closed the gap by presenting 
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an optimal competitive ratio of 0{y/TogW) for SRSA and 0(min{-y^, yloglV}) for MCD on the 
undirected line network with n nodes. They left the conjecture of Berman and Coulston open 
for RSA and for MCD on directed line networks. In the current paper, we prove this conjecture 
(for RSA and for Directed-MCD), thus separating RSA and SRSA in terms of their competitive 
ratios. 

Charikar, Halperin, and Motwani [5] also studied the the offline case for MCD, for which they 
gave a constant approximation. The offline version of RSA is heavily studied. It was attributed to 
m who gave an exponential integer programming solution and to [9] who gave an exponential time 
dynamic programming algorithm. An exact and polynomial algorithm was proposed in [23], which 
seemed surprising, since many Steiner problems are NP Hard. Indeed, difficulties in that solution 
were noted by Rao, Sadayappan, Hwang, and Shor [21], who also presented an approximation 
algorithm. Efficient algorithms are claimed in [7] for VLSI applications. However, the problem was 
proven NP-Hard in [22]. (The rectilinear Steiner tree problem was proven NPH in [lO]). Heuristics 
that are fast “in practice” were presented in [8]. A PTAS was presented by [l6]. An optimal 
logarithmic competitive ratio for MCD on general undirected networks was presented in [2]. They 
also present a constant off-line approximation for MCD on grid networks. 

On the relation between this paper and An additional contribution of the current 

paper is the further development of the approach of developing (fully) online algorithms in two 
stages: (a) develop a pseudo online algorithm; and (b) convert the pseudo online into an online 
algorithm. As opposed to the problem studied in m where a pseudo online algorithm was known, 
here the main technical difficulty was to develop such an algorithm. From m we also borrowed 
an interesting twist on the rather common idea to translate between instances of a discrete and a 
continuous problems: we translate in both directions, the discrete solutions helps in optimizing the 
continuous one and vice versa. 

Our Contributions. We improve both the upper and the lower bounds of RSA to show that 
the competitive ratio is This proves the conjecture for RSA of Berman and Coulston 

[3] and also separates the competitive ratios of RSA and SRSA. We also provide tight upper and 
lower bound for Directed-MCD, the network version of RSA (both in terms of n and of N). The 
main technical innovation is the specific pseudo online algorithm we developed here, in order to 
convert it later to an online algorithm. The previously known offline algorithms for RSA and for 
DMCD where not pseudo online, so we could not use them. In addition to the usefulness of the 
new algorithm in generating the online algorithm, this pseudo online algorithm may be interesting 
in itself: It is 0(l)-competitive for DMCD and for RSA (via the transformation) for a different 
(but rather common) online model (where each request must be served before the next one arrives, 
but no time passes between requests). 

Paper Structure. Definitions are given in Section [2] The pseudo online algorithm Square 
for DMCD is presented and analyzed in Section [S] In Section 0] we transform Square to a 
(fully) online algorithm D-Line™ for DMCD. Then, Section O describes the transformation of 
the online DMCD algorithm D-Line°^ to become an optimal online algorithm for RSA, as well 
as a transformation back from RSA to DMCD to make the DMCD online algorithm also optimal 
in terms of n (not just N). These last two transformations are taken from [15]. Finally, a lower 
bound is given in Section [6] The best way to understand the algorithms in this paper may be from 
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a geometric point of view. Hence, we added multiple drawings to illustrate both the algorithms 
and the proofs. 

2 Preliminaries 

The network X time grid (Papadimitriou et. al, [IH]). A directed line network L{n) = [Vn^En) 
is a network whose node set is = {1,..., n} and its edge set is = {(i, i + 1) | i = 1,..., n — 1}. 
Given a directed line network L{n) = {Vn,En), construct ’’time-line” graph C{n) = {Vn,Sn), 
intuitively, by “layering” multiple replicas of L{n), one per time unit, where in addition, each 
node in each replica is connected to the same node in the next replica (see Fig. [1]). Formally, 
the node set Vn contains a node replica (sometimes called just a replica) {v,t) of every v € 14., 
coresponding to each time step t G N. That is, Vn = {{v,t) \ v G Vn,t G N}. The set of directed 
edges £n = Tdn U An contains horizontal directed edges Tin = {{{u,t), {v,t)) \ iu,v) G En,t G N}, 
connecting network nodes in every time step (round), and directed vertical edges, called arcs, 
An = {((u,t), {v,t -|- 1)) I u G Vn,t G N}, connecting different copies of Vn- When n is clear from 
the context, we may write just X rather than Xn, for every X G {V,E,V,'H,A}. Notice that C{n) 
can be viewed geometrically as a grid of n by oo whose grid points are the replicas. Following Fig. 
[U we consider the time as if it proceeds upward. We use such geometric presentations also in the 
text, to help clarifying the description. 
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Figure 1: An example of a time-line graph C{n) = {V,£ = Td U^). Each node in V is represented 
by a circle; each horizontal edge in Td is represented by a horizontal segment (see, as an example, 
{{u,2), {w,2)) G Id for an horizontal directed edge in the marked rectangle on the right); each arc in A 
is represented by a horizontal arrow (see, as an example, ((u, 3), (u, 4)) G A for an arc in the marked 
rectangle on the left). 


The DMCD problem. We are given a directed line network L{n), an origin node vq G V, and 
a set of requests 7^ G V. A feasible solution is a subset of directed edges E V £ such that for every 
request r G 7^, there exists a path in E from the origin (uq, 0) to r. Intuitively a directed horizontal 
edge {{u,t), {v,t)) is for delivering a copy of a multimedia title from node u to node v at time t. 

A directed vertical edge (arc) ((u,t), {v,t + 1)) is for storing a copy of the title at node v from 
time t to time t -|- 1. For convenience, the endpoints Vj of edges in E are also considered parts of 
the solution. For a given algorithm A, let Ea be the solution of A, and let cost{A,TZ), (the cost of 
algorithm A), be \Ea\- (We assume that each storage cost and each delivery cost is 1.) The goal 
is to find a minimum cost feasible solution. Let opt be the set of edges in some optimal solution 
whose cost is IoptI. 
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Online DMCD. In the online versions of the problem, the algorithm receives as input a sequence 
of events. One type of events is a request in the (ordered) set TZ of requests TZ = {ri,r 2 , ...,rAr}, 
where the requests times are in a non-decreasing order, i.e., ti < t 2 < (^-s in RSA). A 

second type of events is a time event (this event does not exists in RSA), where we assume a clock 
that tells the algorithm that no additional requests for time t are about to arrive (or that there are 
no requests for some time t at all). The algorithm then still has the opportunity to complete its 
calculation for time t (e.g., add arcs from some replica (v, t) to (v, t + 1)). Then time t + 1 arrives. 

When handling an event ev, the algorithm only knows the following: (a) all the previous requests 
ri,..., r*; (b) time t; and (c) the solution arborescence Tev it constructed so far (originally containing 
only the origin). In each event, the algorithm may need to make decisions of two types, before 
seeing future events: 

(l.DMCD) If the event is the arrival of a request r* = (uj,tj), then from which current (time ti) cache (a 
point already in the solution arborescence J^ev when r* arrives) to serve r* by adding horizontal 
directed edges to J^ev 

(2.DMCD) If this is the time event for time t, then at which nodes to store a copy for time t + 1, for 
future use: select some replica (or replicas) (u, t) already in the solution and add to Fev 
an edge directed from {v,t) to {v,t + l). 

Note that at time t, the online algorithm cannot add nor delete any edge with an endpoint that 
corresponds to previous times. Similarly to e.g. [2lll8l|20l[l9l[5], at least one copy must remain in 
the network at all times. 

General definitions and notations. Consider an interval J = {v,v -|- l,...,u + p} V and 
two integers E N, s.t. s < t. Let J[s,t] (see Fig. [2]) be the “rectangle subgraph” of C{n) 
corresponding to vertex set J and time interval [s,t]. This rectangle consists of the replicas and 
edges of the nodes of J corresponding to every time in the interval [s, t]. For a given subsets V' C V, 
hi' C hi and A' C A, denote by (1) replicas of V' corresponding to times s,...,t- Define 

similarly (2) R'[s, t] for horizontal edges of hi'] and (3) A'[s, t] arcs of A'. (When s = t,we may write 
A’[t] = A[s, t], for A E {J, V', ^'}.) Consider also two nodes u, u E F s.t. u<v. LetV-^[{u,t),{v,t)] 
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Figure 2: A subgraph rectangle J[s,t], where J = {2,3,4, 5,6}. 

be the set of horizontal directed edges of the path from {u,t) to {v,t)- Let Ry^[(u,s), (u,t)] be the 
set of arcs of the path from {v,s) to {v,t)- Let dist'^{{u, s), {v,t)) be the “directed” distance from 
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(n, s) to {v,t) in Loo norm. Formally, dist'^{{u, s), {v,t)) = max{t — s,v — u}, if s < t and u < v 
and dist^{{u, s), iv,t)) = oo, otherwise. 

3 Algorithm SQUARE, a pseudo online algorithm 

This section describes a pseudo online algorithm named Square for the DMCD problem. Devel¬ 
oping Square was the main technical difficulty of this paper. Consider a requests set TZ = {tq = 
(0,0), ri = (ui, ti), ..., rw = {vN,tN)} such that 0 < U < Q < When Algorithm Square 

starts, the solution includes just ro = (0,0). Then, Square handles, first, request ri, then, request 
r2, etc... In handling a request rj, the algorithm may add some edges to the solution. (It never 
deletes any edge from the solution.) After handling ri, the solution is an arborescence rooted at 
ro that spans the request replicas ri,...,rj. Denote by SQUARE(f) the solution of Square after 
handling the z’th request. For a given replica r = (v,t) £ V and a positive integer p, let 

S[r,p] = [v - p,v] X [t - p,t] 

denotes the rectangle subgraph (of the layered graph) whose top right corner is r induced by the 
set of replicas that contains every replica q such that ( 1 ) there is a directed path in the layer graph 
from q to r; and (2) the distance from (7 to r in Loo is at most p. For each request rj G TZ, for 
i = 1,...,N, Square performs the following (The pseudo code of Square is given in Fig. |4] and 
an example of an execution in Fig. [3|). 



-•u 

close 





TAIL(i) 


„close 


serve 


(b) 


Figure 3: Square execution example, (a) in case (SQ3), SQUARE(f — 1) (Square’s solution after 
handling point ri_i); defines the radius p^^{i). (b) the serving replica is the leftmost in 

S[ri, 5p^^{i)] n Square(z - I). (c) Square(z). 


(SQl) Add the vertical path from (0,ti_i) to (0,tj). 

(SQ2) Let replica gplose _ ^-^close^ gClose^ is already in the solution Square(z — 1) 

and (1) the distance in Loo norm from to r* is minimum (over the replicas already 

in the solution); and (2) over those replicas choose the latest, that is, = max{t < 

ti I (u9l°®®,t) G SQUARE(f — 1)}. Define the radius of r^ as p^^{i) = distT^{q'p^‘^^^,ri) = 
max{|uj — \ti — s^losej ^close closest replica of the f’th request. 
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(SQ3) Choose a replica g^^rve _ f^^serve^ gSeive^ ^ 5[rj,5 • p^^{i)] such that is already in 

the solution SQUARE(i — 1) and is the leftmost node (over the nodes corresponding to 

replicas of 5[rj,5 • that are already in the solution). Call the serving replica of 

the z’th request. 

(SQ4) Deliver a copy from to r* via ti). This is done by storing a copy in node 

from time to time U, and then delivering a copy from fj) to {vi,ti) H. 

(SQ5) Store a copy in n?®!"^® from time U to time fj + 4 • p^^{i) E|. 


Intuitively, steps SQ1~SQ4 utilize previous replicas in the solution, while step SQ5 prepares the 
contribution of r* to serve later requests. Note that Square is not an online algorithm, since in step 
SQ4, it may add to the solution some arcs corresponding to previous times. Such an action cannot be 
preformed by an online algorithm. Denote by the feasible solution Square(N') 

of Square. Let BASE(i) = {{u,ti) \ rt?®'"^® < u < Vi} and let Base = (notice that 

Base C because of step SQ4). Similarly, let tail(z) = {(r®®’^'^®, f) \ ti < t < ti + 4p^Q(i)} be 
the nodes of the path P.a[(r®®’^'^®, tj)) + 4 • /O^^(O)] (added to the solution in step SQ5) 

and let tail = u(^rTAiL(z). Note that is indeed an arborescence rooted at (0,0). 


• ■<— {(0,0)} is the Square’s solution after handling request Vi-i. 

• When request rj arrives do: 

1 . ^T^Q[jVAmti_i),io,ti)]. 

2’. pSQ(^) ^ in.m{dist^{q,ri) \ q G 

2. Choose a replica _ ^^close^ gClose^ such that gflose jg SQUARE(i — 1) and 

(a) disf)(^(g®^®®®,Tj) = p^Q(z); and 

(b) s?l®®® = maxjt < ti \ (tt®^®®®,f) G SQUARE(f — 1)}. 


3. Choose the serving replica f?®®’^^® = (r®®’^’^®, s®®’^’^®) G 5[rj,5 • p^^(i)] n such that 
n®erve = min |r | 3s such that {u, s) G S[ri, 5 • p^^{i)] C 

[> is the leftmost node corresponding to the replicas of 5[ri,5 • p^^{i)] C 

4. ^SQ ^ ^SQ u P^[(nr^®, s™), (n^^®, t,)] U t^), {v,,U)]- 

> deliver a copy from g'®®’^^® to r* via (r®®’^^®, tj). 

5. ^ T^^UVA[iuf^^^,U), (R™,fi +4pSQ(i))]. 

[> leave a copy at tip^"^® from current time ti to time U + 4p^^{i). 


Figure 4: Algorithm Square. 


■^More formally, add the arcs of Pyi[(u®®'^^®, sf®'^'^®), (uf®'^^®, ti)] and the horizontal directed edges of 
P-H[(rtf®^^®, ti), {vi,ti)] to the solution. 

® More formally, add the arcs of P^[(uf®'^^®, ti), (uf®^^®,ti + 4 • p^^(i))] to the solution. 
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3.1 Analysis of Square 

First, bound the cost of Square as a function of the radii (defined in SQ2). 

Observation 3.1 cost(SQUARE,7^) < llXlili 

Proof: For each request r* E TZ, Algorithm SQUARE pay a cost of to the path between 

Tj’s serving replica a r* itself (see step SQ4) and additional cost of 4/3^Q(i) for serving a copy 

to all replicas of TAiL(i) (see step SQ5). | 

It is left to bound from below the cost of the optimal solution as a function of the radii. 

Quarter balls. Our analysis is based on the following notion. A quarter-ball^ or a Q-ball, of 
radius p E N centered at a replica g = (u, t) E V contains every replica from which there exists 
a path of length p to g 0 . For every request E 7^, denote by (5-BALL®^(rj, p^Q(i)) 0 (also 
(5-BALL®^(i) for short) the quarter-ball centered at with radius p^Q(i). 

Intuitively, for every request E TZ' (where TZ' obey the observation’s condition below), opt’s 
solution starts outside of (5-BALL®‘5(i), and must reach with a cost of p^Q(i) at least. 

Observation 3.2 Consider some subset TZ' ^ TZ of requests. If the Q-balls, (5-BALL®Q(i) and 
Q-BALL®^(j), of every two requests ri,rj E TZ' are edges disjoint, then |opt| > Ylnen' 

Intuitively, for every request r* E TZ' (where TZ' obey the observation’s condition), opt’s solution 
starts outside of Q-BALL®*^(i), and must reach with a cost of fP^{i) at least. 

Proof: Consider some request E 7^'. Any directed path from (uq, 0) to r* must enter the quarter 
ball Q-BALL®^(i) of radius p^Q(7) to reach rj. The length of this path inside the Q-BALL®^(i) is 
p^Q(i). All the Q-balls of TZ' are disjoint, which implies the observation. | 

3.1.1 Covered and uncovered requests 

Consider some request r* = {vi,ti) and its serving replica = (ttp’^^®, sp’^^®) (see step SQ3). 

We say that r* is covered, if Vi — > p^Q(i) (see SQ2 and SQ3). Intuitively, this means the 

solution is augmented by the whole top of the square SQUARE[ri, p^^(i)]; see Figure [3] (a) 
and (b). Otherwise, we say that r* is uncovered. Let cover = {i | r* is a covered request} and let 
UNCOVER = {i \ ri is an uncovered request}. Given Observation 13.21 the following lemma implies 
that 


|opt| > ^ P^^(*)- (1) 

ieCOVER 

Lemma 3.3 Consider two covered requests r* and rj. 

Then, the quarter balls Q-ball®*^}^) and Q-ball®‘^(j) are edge disjoint. 

Proof: Assume without loss of generality that, i > j. Thus, 7^^[(tt®®’^^®, tj), {vj,tj)] (see SQ4) is 
already in the solution when handling request i. Also, p^^{j) > Vj — n®®^^®, since rj is covered. 
Consider three cases. 

^ This is, actually, the definition of the geometric place “ball”. We term them “quarter ball” to emphasize that 
we deal with directed edges. That is, it is not possible to reach {v,t) from above nor from the right. 

® Note that Q-BALL®®^(ri, is different from 5[ri, p^Q(i)], since the first ball considers distances in L 2 norm 

and the last considers distances in Loo norm. 



Case 1. Vj < Vi, see Figure El Since, r* is covered, < dist'^{rj,ri) = max{uj — Vj , ti — tj}. 

If Vj < Vi — then these two Q-balls are edges disjoint. Otherwise, Vi — vj < p^^{i). 

Then p ^^{ i ) < ti — tj which implies that these two Q-balls are edges disjoint. 

Case 2. Vj — p^^{j) < Vi < Vj, see Figure m Then, also Vi > since Vj is covered. Thus, in 

particular, {vi,tj) G Square(z — 1). Hence, p^^{i) < dist'^{{vi,tj),ri = {vi,ti)) = ti — tj, 
which implies that these two Q-balls are edges disjoint. 

Case 3. Vi < Vj — p^^{j)- The Q-ball of Vj is on the right of any possible (radius) Q-ball with as a 
center. Thus, these Q-balls are edges disjoint. 

I 


v'j < 


(3-BALL^Q(i) 

Q-BALL^Qjj^ 

. -fri 


Q-BALL^Q(j) 


\i 

; ; _close 

Q-BAIjL^Q(j) 

1-^^-1- 


—-'‘---ti - > tj 


„clbse 


(b) 


Figure 5: Two covered requests are edge disjoint, case 1; (a) Q-BALL®*3(i) is on the right of 
(5-BALL®Q(j), since Vj < u* — p^^{i)] (b) Vi — p^^{i) < Vj < Vi implying that the whole Q-BALL®Q(i) 
is above Q-BALL®^(j). 


•r,' 


ti - tj > pSQ(i) 


{Vi,tj) 


< Vi < Vj 


Figure 6: Two covered requests are edge disjoint, case 2. 

By the above lemma and observations 13.1113.21 and Inequality ([T]), we have: 
Observation 3.4 Square’s cost for covered requests is no more than 14 • opt. 

It is left to bound the cost of Square for the uncovered requests. 
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3.1.2 Overview of the analysis of the cost of uncovered requests 

Unfortunately, unlike the case of covered requests, balls of two uncovered requests may not be 
disjoint. Still, we managed to have a somewhat similar argument that we now sketch. (The formal 
analysis appears later in Subsection 13.2[ i Below, we partition the balls of uncovered requests into 
disjoint subsets. Each has a representative request, a root. We show that the Q-ball of roots are 
edge disjoint. This implies by Observation 13.11 and Observation 13.21 that the cost Square pays for 
the roots is smaller than 14 times the total cost of an optimal solution. Finally, we show that the 
cost of Square for all the requests in each subset is at most twice the cost of Square for the root 
of the subset. Hence, the total cost of Square for the uncovered requests is also just a constant 
times the total cost of the optimum. 

To construct the above partition, we define the following relation: ball Q-ball®^(j) is the child 
of Q-BALL®^(i) (for two uncovered requests r* and rj) intuitively, if the Q-BALL®*3(i) is the first ball 
(of a request later then rj) such that Q-BALL®^(i) and (5-BALL®^(j) are not edge disjoint. Clearly, 
this parent-child relation induces a forest on the Q-balls of uncovered requests. The following 
observation follows immediately from the definition of a root. 

Observation 3.5 The quarter halls of every two root requests are edge disjoint. 

Proof: Consider two root requests r* and rj. Assume W.O.L.G that j < i. Also assume, toward 
contradiction that (5 -ball®^(z) and (5 -ball®^(j) are not edge disjoint. By the definition of the 
child parent relationship, either rj is child of r,, or rj is a child of some other request r^ for some 
j < i < i. In both cases, rj has a parent, hence rj is not a root request which contradict to choice 
of Vj as a root request. The observation follow. | 

The above observation together with Observation 13.21 implies the following. 

Observation 3.6 The cost o/ Square for the roots is 14 • |opt| at most. 

It is left to bound the cost that Square pays for the balls in each tree (in the forest of Q-balls) 
as a constant function of the cost it pays for the tree root. Specifically, we show that the sum of 
the radii of the Q-balls in the tree (including that of the root) is at most twice the radius of the 
root. This implies the claim for the costs by Observation 13.11 and Observation 13.21 To show that, 
given any non leaf ball (5-BALL®^(i) (not just a root), we first analyze only (5 -ball®^(z)’s “latest 
child” (5 -ball®^(j). That is, j = maxfc{(5-BALL®^(A:) is a child of Q-BALL®^(f)}. We show that 
the radius of the latest child is, at most, a quarter of the radius of (5-BALL®^(f). (See Property 
(PI) of Lemma [3.14l in Subsection [321) Second, we show that the sum of the radii of the rest of the 
children (all but the latest child) is, at most, a quarter of the radius of Q-BALL®^(f) too. Hence, 
the radius of a parent ball is at least twice as the sum of its children radii. This implies that the 
sum of the radii of all the Q- balls in a tree is at most twice the radius of the root. 

The hardest technical part here is in the following lemma that, intuitively, states that “a lot of 
time” (proportional to the request’s radius) passes between the time one child ball ends and the 
time the next child ball starts, see Fig. [71 

Lemma 3.7 Consider some uncovered request ri which has at least two children. Let Q-ball®^(j), 
(5-BALL®^(fc) some two children o/Q-ball®*^(z), such thatk < j. Then, tj—p^^{j) > tk+4:p^^{k). 

Intuitively, the radius of a parent Q-ball is covered by the radii of its children Q-balls, plus 
the tails (see step SQ5) between them. Restating the lemma, the time of the earliest replica in 
Q-BALL^^{j) is not before the time of the latest replica in tail(/c). Intuitively, recall that the tail 
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pSQ(j) 


p^^{k) 


Figure 7: Geometric look on a parent (5-BALL®‘3(i) (note that a Q-ball is a triangle) and its children 
(5-BALL®^(j) and Q-BALL®^(fe). 

length of a request is much grater than the radius of the request’s Q-ball. Hence, the fact that 
the radius of a latest child is at most a quarter of the radius of its parent, together with Lemma 
13.71 imply that the sum of the childrens radii is less than half of the radius of the parent Q-ball. 

The full proof of Lemma ITTI (appears in Subsection 13.21) uses geometric considerations. Out¬ 
lining the proof, we first establish an additional lemma. Given any two requests rj and such 
that j > i, the following lemma formalizes the following: Suppose that the node vj of request rj is 
“close in space (or in the network)” to the node vg of another request r^. Then, the whole Q-ball 
of rj is “far in time” (and later) from rj. 

Lemma 3.8 Suppose that, j > £ and Vj — p^^{j) + 1 < < vj. Then, the time of the earliest 

replica in (5-ball®‘3(j) is not before the time of the latest replica in tail(.^), i.e., tj — > 

+ 4pSQ(£)_ 

Intuitively, Lemma 13.81 follows thanks to the tail left in step SQ5 of Square, as well as to the 
action taken in SQ3 for moving further left of In the proof of Lemma 13.71 we show 

that in the case that two requests r^ and rj are siblings, either (1) they satisfy the conditions of 
Lemma [T8l or (2) there exists some request such that k < I < j such that and rj satisfy the 
conditions of Lemma 13.81 Moreover, the time of the last replica in tail(.^) is even later then the 
time of the last replica in tail(/c). In both cases, we apply Lemma 13.81 to show that the time of 
the earliest replica in Q-ball®^(j) is not before the time of the latest replica in tail(A:) as needed 
for the lemma. 

To summarize, we show (I) For covered requests the cost of Square is 0(1) of |opt|; see 
Observation 13.41 (2) For uncovered requests, we prove two facts: (2.a) the Q-balls of the root 
requests are edges disjoint, and hence by Observation 13.61 the sum of their radii is 0(1) of |opt| 
too. (2.b) On the other hand, the sum of root’s radii is at least half of the sum of the radii of all 
the uncovered requests. This establishes Theorem 13.91 (which prove appears in Subsection 13.2p . 

Theorem 3.9 Algorithm Square is 0{l)-competitive for DMCD under the pseudo online model. 
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3.2 Formal analysis of the cost of uncovered requests 

We start with a formal definitions of the forest of parent-child relationships. 

Forest of balls. For any uncovered request r*, define the following notations. 

1. Let parent{i) = j be the minimal index grater than i such that (5 -ball®*^(z) and (5 -ball®*^(j) 
are not edges disjoint, if such exists, otherwise parent{i) =±. 

2. children(z) = {j I parent{j) = i}. 

3. tree(z) = UjgCHiLDREN(i) TREE(j), if children(z) / 0, otherwise TREE(i) = {i}. 

4. ROOTS = {i I parent{i) =-L}. 

(We also abuse the definition and say that request rj is child of request rp, and j is child of i, if 
(5-BALL®Q(j) is child of (5 -ball®Q(z).) 

We now, state four observations about uncovered requests. The main lemmas use these obser¬ 
vations heavily. Recall that, _ ^^close^ gClose^ closest replica of (see SQ2). 

Observation 3.10 The radius of an uncovered request is determined by the time difference from 
its closest repliea. That is, if a request ri is uncovered, then p^^(z) = ti — gdose^ 

Proof: If ri is uncovered, then Vi — < p^^{i). In addition, Vi — < Vi — since 

u®erve < ^close ^-g^g SQ2). Thus, Vi — < p^^{i) too. Therefore, p^^{i) = U — since 

p^^(i) = dist^{qf^^^^,ri) = max{ui — ^ggg gQ2). | 

Observation 3.11 The replicas of the “reetangle-graph” [uj ——1] x [t* —5p^Q(i), L] 
are not in SQUARE(i). 

Proof: Assume by the way of contradiction that some replica q = {w,t) G [vi — 5/0^^(i), — 

1] X [ti - 5p^Q(i),t i] is in SQUARE(i). This implies that Vi — 5p^Q(i) < w < contradicting 

the choice (in step SQ3) of node of the serving replica as the leftmost 

node over all replicas that are in the solution and in 5[rj, 5 • p^Q(i)]. | 

Observation 3.12 Consider some request ri. Assume that its closest replica is added to 

Square’s solution when handling request rj (for some j < i). Then, time of the 

i’th elosest replica is not before the time tj ofrj). 

Proof: The replica gf^°®® = (u?^°®®, s?^°®®) is added to the solution in step SQ4 or in step SQ5 
while Square is handling request rj. If gflose jg solution in step SQ4, then the replica 

(^close,^^.) jg g^g[(jgg[ ^]^g solution in that step too; otherwise, gf^°®® is added in step SQ5, and 
then a replica of at time t (for some time t > tj) is added to the solution. This implies that 

gClose > ggg g^gp gQ 2 fg], ^]^g selection of gf^°®®. | 

Observation 3.13 If there exists a repliea {w,ti) in the solution of Square(z — 1) such that 
0 < Vi — w < 5p^^{i), then r* is a covered request. 

Proof: By the definition, p^^{i) < distf^{{w,ti),ri), since the distance from w to Vi is a candidate 
for /3^Q(i). The observation now follows from the definition of a covered request. | 
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3.2.1 Parent ball in tree larger then its child 


As promised (in the overview), Property (PI) of Lemma 13.141 below implies that a parent ball in 
tree is at least four times larger than its “last child”. In fact, the lemma is more general (Property 
(P2) is used in the proof of other lemmas 

Lemma 3.14 Consider two uncovered requests and rj such that i > j. If Q-ball®^(z) and 
(5-BALL®^(j) are not edges disjoint, then the following properties hold. 

(PI) p^^{i) > 4 • and 

(P2) Vj - p^Q(j) < Vi < < Vj ( the leftmost replicas of Q-BALL^^{j) are on left of ri; r* is 

on the left of rj and even on the left of the j ’th serving replica). 

Proof: We first prove Property (P2). Since (5-BALL®^(f) and (5 -ball®^(j) are not edges disjoint, 

Vj — p^^{j) < Vi and Vi — p^^{i) < Vj . (2) 

Since also i > j (see Figure [S]), 

= ti-sf°^^ >ti-tj, (3) 

where the equality below follows from Observation 13.101 since rj is uncovered; and the inequality 
holds since, on the one hand, (5 -ball®^(z) and (5 -ball®^(j) are not edge disjoint, hence has a 
common edge; on the other hand, (1) for every edge in Q-BALL®^(i), at least one of its corre¬ 
sponding replicas corresponds to time strictly grater than U — p^^{i); however, non of the edges of 
(5-BALL®^(j) corresponding to replicas of time strictly grater than tj . 

The left inequality of Property (P2) holds by the left inequality of ([2]). The right inequality of 
Property (P2) holds trivially, see step SQ3. Assume by the way of contradiction that the remaining 
inequality does not holds, i.e., Vi > Consider two cases. 

Case 1. < Vi < Vj, see Figure [H Then, {vi,tj) is in SQUARE(j) (Square’s solution after 

handling request rj). Thus, p^^{i) < dist)^{{vi,tj),ri = {vi,ti)) = ti — tj, contradicting 
Inequality Q. 

Case 2. Vj < Vi, see Figure fTOl In this case, p^^{i) < dist)f,{rj,ri) = max{uj — Vj , tj — tj}. By the 
second inequality of ([2|), Uj — Vj < p^^{i). Hence, p^^{i) < ti — tj. Again, this contradicts 
Inequality (|3|). 

These two cases shows that Property (P2) holds. We next show that Property (PI) holds too. For 
that, consider two cases. 

Case A: < Vj — 5p^^{j). In other words, the closest replica _ ^.^close^ ^close^ jg 

the left of 5[rj, 5p^Q(j)] (see Figure El). Recall that the closest replica defines the 

radius p^^{i) (see SQ2), i.e., p^^{i) = max{uj — ri?^°®®,tj — s?^°®®}. We have (the second 
inequality bellow follows by substituting Uj using the first inequality of ([2])), 

p^^ii) >Vi- nfo®" > {vj - pSQ(j)) - {v^ - 5pSQ(j)) = 4pSQ(^.) 

® Actually, this lemma shows that property for any other child too, but for the other children this is not helpful, 
since there may be too many of them. 
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n 

...* 

\p-BALL^Q(i| 



pSQ(*) 

Vj- ‘p^^{j) < Vi 

Figure 8: Q-BALL®‘5(i) and Q-BALL®*^(j) are not edges disjoint implying inequalities ([2]) and (|3l) . 

as needed for Property (PI). (Intuitively, since Q-BALL®*^(i) intersect Q-BALLp^{j) on the 
left, Uj’s is at most p^^{j) left of Vj] however, we assumed that is at least 5p^^{j) left 

of Vj] hence, p^^{i) > 4p^^{j).) 

Case B: Vj — 5p^^(j) < We have that (see Figure [12]), 

Vj - 5p^%) < 4'“' < «f™ . (4) 

where the inequality on the right holds by Property (P2) of this lemma. Assume that is 

added to the solution when Square handles some request (for some k < i). By Observation 
13.121 If A: > j, then tk > tj, which means that > tj contradicting Inequality 

([3|). Thus k < j. Therefore, by Observation 13.111 ^ [vj — 5 / 9 ^Q(j), “ 1] x [tj — 

5p^^{j),tj]. However, by Inequality ([4|) , G [uj —5/?^Q(j),u®®®^® —1]. Also by Inequality 

([3|), s?l®®® < tj. Hence, 5 ?^®®® < tj — 5 / 9 ^Q(j), implying p^^{i) > 5p^Q(j). Property (PI) 
follows. 

As showed above Property (PI) and Property (P2) hold, the lemma follows too. | 

n 

T-T b 

> 



..serve 

Uj 


Figure 9: r* is on the right of and on the left of rj (case 1, u®®'^''® < Vi < Vj). 
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Figure 10: r* is on the right of rj (case 2, Vj < Vi). 


„close 


4pSQ(j) 


uf^ PST 

Vj - 5pSQ(j) Vj - p®Q(j) 


Figure 11: is on the left of S[rj,5p^'^{j)]. 


,SQt 


3.2.2 Uncovered request has at least two children 

The previous lemma suffices for the case that an uncovered request has only one child. We now 
consider the case where an uncovered request has at least two children. We first establish Lemma 
13.81 (which state in the proof overview) that deals with the case that the quarter ball of request rj is 
later than the tail of some previous request (for some i < j). Before representing the proof of this 
lemma, let us make two “geometric” definitions. Consider two given requests rj and Vi such that 
i > j. Intuitively, (5-BALL®^(f) is later than TAiL(j), if the time of earliest replica of Q-BALL®^(f) 
is not before the time of the last replica of tail(j). Formally, (5-BALL®^(f) is later than tail(j), 
if ti — p^^{i) > tj + 4/9^Q(j). In addition, we say that tail(j) (which contains only replicas of 
.yServe^ is in the range of (5-BALL®^(z) (which contains replicas of the nodes of {vi — p^^{i), ..., Vi}), 
if Vi — p^^{i) < < Vi (in other words, Vi — p^^{i) and there exists a replica of 

in Q-BALL®Q(i)). 
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5pSQ(j) 




.close 


> 5pSQ(j) 


X .„close 
*2 


Figure 12: is below S[rj,5p^^{j)]. 


Before presenting the proof of Lemma 13.81 let us remaind and a bit restate this lemma (using 
formal notations). 

Lemma 13.81 Consider two requests r^ and rj such that j > £. Suppose that, tail(.£) is in the 
range of i.e., Vj — p^^{j) < < Vj. Then, Q-BALL^^{j) is later than tail(£). 

That is, tj — p^^{j) >ti + Ap^^{£). 

Proof of Lemma 13.81 Consider two requests rj and ri that satisfy the conditions of the lemma. 
We begin by showing a slightly weaker assertion, that rj itself is later than tail(£). That is, 
tj > t£ + 4:p^^{i). Assume the contrary, that tj < ti + 4p^^{i). Note that the replicas of TAiL(t') 
of time no later than tj (if such do exists) are “candidates” for the closest and the serving replicas 
of the j’th request (since they belong to the solution SQUARE(j — 1)). Thus, 

p^Q(j) < distZ{{uf'^^^,tj),rj) = Vj - . 

That is, the inequality holds since tj) G SQUARE(j — 1) (see step SQ2); the equality holds 

since rj = {vj,tj) and < Vj. This means that the complete j’th quarter-ball is on the right 

of the £’th serving replica g'®®’’^®, i.e., 

^serve < 

This contradicts the condition of the lemma, hence tj > t£ + 4/?^^ {£) as promised. 

We now prove the lemma’s assertion that tj — p^^{j) > t£ + 4:p^^{i). Denote by g^^®^ the latest 
replica in tail(.£), i.e., g|^®^ = (u®®’^’^®,t£ + 4/9^Q(.^)). Note that g^^®^ is a candidate for the closest 
replica of the j’th request, since g^®"®^ G SQUARE(j — 1) and the time of g^^®^ is earlier than the 
time of Tj (i.e., tj > t£ + Ap^^{£)). Thus, the radius p^^{j) of the j’th request is at most as the 
distance between g^^®^ to rj, see step SQ2. That is. 


P^^U) < distZ{q\^^^ , rj). 

(5) 

In addition, by the condition of the lemma. 


Uj - u^^® < pSQ(j). 

(6) 

Thus, by Inequalities ([5]) and ([6]) 


Vj - < dist^iq],^^^ , rj). 

(7) 
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Recall that, dist^{q\^^^ , rj) = max{tij — , tj — {t^ + 4/9^Q(£))}. Hence, by Ineq. ([7]), 

dist^{q\^^^ , rj) = tj - {t^ + 4 pSQ(^)). 

Combining this with Ineq. ([5]), we get also that p^^{j) < tj — {ti + 4/9^Q(£)). The Lemma follows. 

I 

Now, we are ready to show the main lemma (Lemma 13.71) . which intuitively, shows that “a lot 
of time” (proportional to the request’s radius) passes between the time the one child ball ends and 
the time the next child ball starts. 

We begins by remaining this lemma and restate it a bit (using formal notations). 

Lemma 13.71 Consider some uneovered request ri such that |CHiLDREN(i)| > 2. Let j,k G 
CHiLDREN(i) such that k < j. Then, Q-ball®Q(j) is later than tail(/c). That is, tj — > 

tk + 4p^Q(/c). 

Proof of Lemma 13.7t We consider two cases regarding the relation between the serving replica 
R|erve q£ request and the node Vj of the j’th replica. 

Case 1: < Vj. This is the simpler case. Apply Lemma 13.81 with the requests j and i = k. 

First, note that j > k as required to apply Lemma iTSl To use this lemma, it is also required to 
show that 

V, - p^%) < ur™ < V,. (8) 

The right inequality holds by the assumption of this case. The left inequality holds since 

Vi - p’^%) < Vi < -f ™, 

where the first inequality holds by Lemma 13.141 Property (P2) with i and j; the second inequality 
holds by Lemma (3.141 Property (P2) with i and k. Thus, in this case, the lemma follows by Lemma 

ITSI 


Case 2: Vj < (that is, Vj is on the left of the fc’th serving replica Note that, unlike 

the previous case, tail(A;) is not in the range of Q-BALL®^(j). Thus, the condition of Lemma 13.81 
does not holds, and we cannot apply Lemma 13.81 with j and i = k. Fortunately, we show that in 
this case, we can use another request for which Lemma 13.81 can be applied. That is, we claim that 
in this case, there exists a request that has the following three properties. 

(PI) k<i<r, 

(P2) TAiL(t') is in the range of (5-BALL®^(j) (it satisfies the condition of Lemma 13.81) : and 
(P3) p^^{i) > p^^{k). 

Note that if indeed such a request (that has the above three properties) does exists, then applying 
Lemma 13.81 we will get that 

tj - P^^U) >te + 4pSQ(£) >tk + 4pSQ(fe). 

The last inequality follows from Property (P3) and since tj > ti (since j > i). This will imply the 
lemma. It is left to show that such a request r£ must exist. Let 

REC = [vi,Vj] X [tk - 4:p^^{k),ti], 
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and let 


t = niin Square(/) n REC ^ 0, 

the index of the first request in which the solution Square(£*) contains some replicas in REC. Note 
that REC is well defined since Vi > Vj by Lemma 13.141 Property (P2). We completes the proof by 
showing that i* exists and has properties (P1)-(P3). Hence, we can choose ri = ri* and the lemma 
will follow. 

1. £* has Property (PI), i.e., k < i* < j. 

We first show that Square(A:) does not contain any replica from the rectangle graph rec. 
That is, 


Square(A:) n rec = 0. (9) 

Then, we show that Square(j — 1) does contain some replicas from the rectangle graph rec. 
That is, 


SQUARE(j - 1) n REC 7 ^ 0. (10) 

Once we prove the above two inequalities, they will imply that i* does exist, and in particular, 
k < i* < j as needed. 

Proving Ineq ([9]): Note that when Algorithm SQUARE handles it does not add 
any replica in the above rectangle, since it only adds replicas on the right hand side of 
u|erve^ (Recall that, Vi < Vj and we are now analysing case (2) where Vj < i.e., 

[vi,Vj] C - 1].) 


It is left to prove that Square(/c — 1) does not include a replica in rec. By Observation 
13.111 it follows that Square(A:) and SQUARE(fe — 1) do not contain any replica from the 
“bottom part” of REC, since 

{(u,t) G REC I t < 4} C [vk - - 1] x [4 - 5p^^{k),tk], 

where the inequality holds since Vk — 4:p^^{k) < Vi (by Lemma 13.141 Property (P2)); 
and Vj < (the assumption of case (2)). 

It is left to prove that Square(A: — 1) does not contain any replica from the “top part” 
of REC. 

Assume by the way of contradiction that there exists a replica q = {u, s) G REC O 
Square(/c — 1) such that s > tk- Let n be the request in which Square added q to 
the solution (that is, when Square was handling n, it added q to the solution). The 
assumption that q G Square(A; — 1) implies that such a request r/ does exist, and in 
particular, I < k — 1. Thus, tk > ti, and hence, s > ti . This implies that q is added 
to the solution in step SQ5 and q G tail(/) = t;)) + 4 • /9^^(Z))]. 

Therefore, also, (n®®^^®,4) £ tail(^) (since ti < tk and tk < s <ti+A - p^^{l)), and in 
particular, 

(^serve^^^) ^ SquaRe(A: - 1). 
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In addition, G \vi,Vj\, since q G REG, and also 

Vk - P^^{k) < Vi < < Vj < < Vk , 

where the first and the last inequalities hold by Lemma [3. 141 Property (P2) with i and k; 
the second and the third inequalities hold since G [vi,Vj]; and the fourth inequality 

holds by the assumption of case (2). 

Therefore, in particular, 0 < Vk — < p^^{k). Thus, by Observation 13.131 Vk is a 

covered request, contradicting the assumption that k is child of i (covered requests have 
no parents). Therefore, Square(A: — l)nREC = 0 and (as mentioned) also SQUARE(/c)n 
REG = 0. Hence, Ineq. Q holds. 

Proving Ineq. (|10p : Recall that the j’th closest replica _ ^^close^ ^close^ ^ 

SQUARE(j — 1), see step SQ2. Thus, to show that Ineq. (fTOl) holds, it is sufficient to 
show that g®^®®® G REG. 

The assumption that Q-ball®^(j) and Q-BALL®^(i) are not edge disjoint implies that 
the j’th serving and closest replicas are on the right of r*. That is, 

Vj - P^Q(j) <Vi< u®®'^^® < U®1®®® < Vj < u™, 

where the first and the second inequalities hold by Lemma [3.14l Property (P2); the third 
and the forth inequalities hold by steps SQ2 and SQ3; and the fifth inequality is the 
assumption in the current case (2). 

This implies, in particular, that 

Vj - p^^ij) <Vi< U®^®®® < Vj . (11) 

In addition, by Observation [STOl the radius of an uncovered request is the time difference 
between the request and its closest replica, that is, p^^{j) = tj — gClose^ equivalently 

= tj - /Qo). (12) 

Recall that k and j are children of i, thus Q-BALL®Q(j) and (5-BALL®‘^(/i;) are edges 

disjoint. This, together with inequalities (fTT]i and (fT^ imply that 

> tk . (13) 

Hence, ^ j^y inequalities (flTll and (fOl) and since s^lose < Thus, Ineq. 

(fTOl) holds as promised. 

We have shown that inequalities ([9]) and (fTOjl hold as we argued above this implies that 
has Property (PI). 

2. i* has Property (P2), i.e., tail(^*) is in the range of Q-BALL®Q(j). Recall that i > j\ 
and (5-BALL®Q(i) and Q-BALL®‘^(j) are not edge disjoint, thus by Lemma (3. 141 Part 2, 

Vj - p^'^{j) <Vi <Vj . (14) 
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We show that 


Vi < < Vj , (15) 

which implies together with Ineq. m that Vj — p^^{j) < < Vj as needed (for showing 

that tail(^*) is in the range of Q-ball®‘^(j)). 

It remains to show that Ineq. m holds. Note that, on the one hand, the choice of ri* (as 
the first request which the solution Square(^*) contains a replica in rec) implies that some 
replica q' = {u',t') E REC is added to the solution when Square handles r^*. On the other 
hands, when Algorithm SQUARE handles r^*, it only adds replicas (in steps SQ4 and SQ5) to 
the right of and to the left of v^*. Thus, on the one hand, Vi < u' < Vj, and on the 

other hand, <u'< V£*. Hence, also 

Vi < Vi* and u®F^® < Vj . (16) 


This already establish the right inequality of (fT^ . To show that its left inequality holds too, 
assume toward contradiction that < Vi. Combining this with the left inequality of ()16p , 

we have 


serve 


<Vi< Vi* . 


This implies that, when Algorithm SQUARE handles Vi* , it added the replica {vi,ti*), in step 
SQ4 to the solution. Hence, {vi,ti*) E Square(.£*), and is a candidates for the i’th close 
replica (see step SQ2). Thus, 


< dist^{{vi,ti*,ri)) = ti - ti* . 


Hence, the time of each of Q-BALL®^(i)’s replicas is at least ti*. Recall that, ti* > tk (since 
i* > k)-, and that in each edge e of Q-BALL®Q(i) at least one of e’s endpoints is corresponds 
to time later than Vi — p^^{i). Therefore, (5-BALL®‘^(i) and (5 -ball®Q(A:) are edge disjoint, 
which contradicts the choice of A: as a child of i. Hence, Vi < Ineq. (fT^ holds and i* 

maintains Property (P2) as promised. 

3. £* has Property (P3), i.e., p^^{£*) > p^^{k). 


We first show that the time of the serving replica of the l’*’th request is before 

tk — 5p^^{k). That is, 

sfF^®<4-5pSQ(A;). (17) 

The choice of £* implies that Square(A* — 1) n REC = 0. On the other hand, the serving 
replica g|®^^® does belong to Square(A* — 1) (see step SQ3). This implies, in particular, that 

g,®F^® = (u^f^®,s^f^®) ^ REC = Kvj] X [tk - £>p^^{k),ti] . 

Recall that u®?’^^® E [vi,Vj] by Ineq. (fT^ . hence s®?^^® 0 [tk — 5p^^{k),ti]. Inequality (fT7)l 
holds, since s®*’^^® < ti* < ti. 

Summarizing what we know so far, ti* > tk and sp^^® < tfc — 5p^^{k). Thus, on one hand, 
dist^{qff^^^,ri*) > ti* — s®?’^^® > 5p^^{k). (18) 
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On the other hand, G 5[rf*, 5 • p^Q(£*)] (see step SQ3), which implies that 

> distZ{qp^^^,rt^) • ( 19 ) 

Inequalities (|18lll9p . imply that p^^{£*) > p^^{k) as needed. Hence, i* maintains Property 
(P3). 

We have shown that r^* maintains the three properties, implying the lemma for case (2) too. 
[Lemma 13.7j | 

The previous lemma shows that a lot of time pass between the time of the last replica in the 
quarter ball of a child and the time of first replica in the quarter ball of the next child. The next, 
lemma use this property to show that the radius of a root is at least half of the sum of the radii of 
its children in its tree. 

Lemma 3.15 Consider some root request rj* G ROOTS. Then, 

> E 

igTREE(i*) 

Proof: We begin by showing that the radius of each ball Q-BALL®Q(f) in the tree is at least, 
twice the sum of the radii of its children. Consider some non leaf request G TREE(i*) (that is, 
CHiLDREN(f) 7 ^ 0). Let us first, show that 

pSQ(i)>2 ^ P^%). (20) 

jeCHILDREN(i) 

If CHiLDREN(i) = {j} [i has exactly one child), then ()20p follows from property (PI) of Lemma 
13.141 Otherwise, CHiLDREN(f) = {ji, ji/}) where v >2 and ji < < ... < jv (For simplicity, 

to avoid double subscripts, we may write t{l) instead of ti.) By Lemma [3771 with k = ji and j = j)+i, 
it follows that 


t{ji) + 4/5^^(j/) + P^^{ji+i) < (21) 

for every I = 1,..., n — 1. Now, (see Figure [13]) 

U—1 

p^^{i)>t,-t{h)>4Y,P^^iji), ( 22 ) 

1=1 

where the first inequality holds since the (5 -ball®* 4(^[) and (5‘BALL®*4(jj^) are not edges disjoint; 
the second inequality holds by Inequality (f 2 T]) . since U > t{jv). In addition, by Property (PI) of 
Lemma 13.141 

pSQ(f) > 4p^^{ju), (23) 

which implies Inequality (I20|) that implies the lemma. | 

So far, we have shown that (1) the quarter-ball of the covered requests are edges disjoint; ( 2 ) the 
quarter-ball of the root requests are edges disjoint, and hence by Observation 13.11 and Observation 
[321 the sum of their radii of the covered request and the root requests is no more than 28 times 
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Figure 13: Geometric vision on a parent and its children relationships. 

the cost of OPT. On the other hand, the sum of root’s radii is at least half of the sum of the radii 
of the uncovered requests. This, in fact, establishes Theorem 13.91 

Proof of Theorem 13.91 The ratio for covered request follows Inequality ([T]). For uncovered 
requests it follows from Observation 13.51 and Observation 13.21 that |opt| > XlieROOTS 
Combining this with Lemma [3.151 we have, 2|opt| > X]iGUNCOVER/®^^(*)- Thus, also, 3|opt| > 
Ylneii The Theorem follows from Observation [3T] | 

4 Algorithm D-LlNE°^ - the “real” online algorithm 

In this section, we transform the pseudo online algorithm Square of Section [3] into a (fully) online 
algorithm D-LiNE®*^ for DMCDl|. Let us first give some intuition here. 

The reason Algorithm Square is not online, is one of the the actions it takes at step SQ4. 
There, it stores a copy at the serving replica for request r* from time to time L. This 

requires “going back in time” in the case that the time < tj. A (full) online algorithm cannot 

perform such an action. Intuitively, Algorithm D-Line°^ “simulates” the impossible action by (1) 
storing additional copies (beyond those stored by Square); and (2) shifting the delivery to request 
rt (step SQ4 of Square) from an early time to time ti of rj. It may happen that the serving node 
yServe q£ (£Qgg have a copy (in Square) at ti. In that case. Algorithm D-Line°’^ also (3) 
delivers first a copy to ti) from some node w on the left of Simulation step (1) above 

(that we term the storage phase) is the one responsible for ensuring that such a node w exists, and 
is “not too far” from 

For the storage phase. Algorithm D-Line°^ covers the network by “intervals” of various lengthes 
(pathes that are subgraphs of the network graph). There are overlaps in this cover, so that each 
node is covered by intervals of various lengthes. Let the length of some interval I be length{I). 
Intuitively, given an interval I and a time t, if Square kept a copy in a node of interval I “recently” 

^ We comment that it bears similarities to the transformation of the pseudo online algorithm Triangle to a (full) 
online algorithm for undirected MOD in m- The transformation here is harder, since there the algorithm sometimes 
delivered a copy to a node v from some node on v’s right, which we had to avoid here (since the network is directed 
to the right). 
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(“recent” is proportional to length{I)), then D-Line°'^ makes sure that a copy is kept at the left 
most node of this interval, or “nearby” (in some node in the interval just left to I). 

Now, we (formally) illustrated Algorithm D-Line°^. We begins by giving some definitions. 

Partitions of [l,n] into intervals Consider some positive integer 5 to be chosen later. For 
convenience, we assume that n is a power of S. (It is trivial to generalize it to other values of 
n.) Define log^n + 1 levels of partitions of the interval [l,n]. In level I, partition [l,n] into n/5^ 
intervals, /((5)i, , each of size SK That is, = {{j — 1) ■ d’’ + k \ k = 1 ,..., S’’}, for 

every 1 < j < n/S^ and every 0 < I < log^n. Let I{d) be the set of all such intervals. When it is 
clear from the context, we may omit (5) from Z(5) and I(6)[ and write X and /(, respectively. Let 

£{I) be the level of an interval I el, i.e., f(/j) = 1. For a given interval Jj G I, denote by i^^(/j), 
for 1 < j < n/6^ the neighbor interval of level I that is on the left of ij. That is, = Ij-i- 

Define that = {0}. Let i^(/) = U I. We say that i^(/) is the neighborhood of I. 

Denote by /^(u) (for every node v £ V and every level I = 0, ...,log^n) the interval in level I 
that contains v. That is, /^(u) = /(,, where k = + 1. The neighborhood J^Yv) of a node v 

contains all those nodes in the neighborhood {l\v)) (of the interval of level I of v) that are left 
of V. That is, J^Yv) = {u £ 'l^{P{v)) | tt < u}. 

Active node Consider some node v £V, some level 0 < ^ < log^n. Node v is called {I, 5)-active 
at time t, if (Base U tail) n v[t — S^t] / 0. Intuitively, Algorithm Square kept a movie copy in 
V, at least once, and “not to long” before time t. We say that v is {l,5)-stays-aetive, intuitively, if 
V is not “just about to stop being (/, (5)-active”, that is, if (Base U tail) n v[t — 6^ l,t] / 0. 

Let us now construct Ct+i, the set of replicas corresponding to the nodes that store copies 
from time t to time t + I in a D-LiNE°'^ execution. Let Cq = {(uq = 0,0)}. (The algorithm 
will also leave a copy in uq = 0 always.) To help us later in the analysis, we also added an 
auxiliary set commit C {{I,t) \ I £ Z{5) and t £ N}. Initially, commit £- 0. For each time 
t = 0,1,2,..., consider first the case that there exists at least one request corresponding to time 
t, i.e., Tl[t] = {rj,...,rfc} ^ 0. Then, for each request G Tl[t], D-Line°^ simulates Square to 
find the radius p^^{i) and the serving node of the serving replica = (up’"'^®, ^serve^ q£ 

Vi- Unfortunately, we may not be able to deliver, at time t, a copy from g?®^’^® may be f > s?®’"^®. 
Hence, D-Line°^ delivers a copy to r* via (this is called the “delivery phase”). That is, 

for each i = j ,..., k do: 

(Dl) choose a closest (to {uf^^^^,t)) replica = {uy^,t) on the left of of time t = U 

already in the solution; 

(D2) add the path T-P’^{i) = to the solution. 

Let V’™(f) = {r I {r,q) £ ^°’^(f)}. (Note that rj is served from Ct, after that, the path ^™(j) is 
added; and r^+i is served from Ct U V°’^(j), etc.) 

Recall that before the delivery phase, the replicas of Ct have copies. It is clear, that the delivery 
phase of time t ensures that the replicas of BASE[t] U TAiL[t] have copies too. That is, at the end of 
the delivery phase of time t, at least the replicas of Ct U BASE[t] U TAiL[t] have copies. It is left to 
decide which of the above copies to leave for time t + I. That is (the “storage phase”), D-Line°'^ 
chooses the set Ct+i C CtUBASE[t]UTAlL[t]. Initially, Ct+i £- {(uq, t + l)}U{(u, t + I) | {u,t) £ tail} 
(as we choose to leave copy at the replicas of the tails and to leave a copy at vq always). Then, for 
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each level I = 0 ,log^ n, in an increasing order, the algorithm goes over and each node v = 1, n, 
in an increasing order, selects as follows. 

(SI) Choose a node v such that (1) v is level (/, (5)-stays-active at t; but ( 2 ) no replica has been 
selected in level I u’s neighborhood {Ct+i H N^{v)[t + 1 ] = 0). If such a node v does exist, 
then perform steps (S1.1-S1.3) below. 

(51.1) Add the tuple {P{v),t) to the auxiliary set commit; we say that the interval P{v) commits 
at level I at time t. 

(51.2) Select a node u G i^^(u) such that a replica of u at time t is in BASE[t] UCt (by Observation 
[Q below, such a replica does exist, recall that all these replicas have copies at this time). 

(51.3) Add {u,t + 1 ) to Ct+i and add the arc {u,t + 1 )) to the solution. 

The solution constructed by D-Line°'^ is denoted where = U^^^°’^(i) 

represents the horizontal edges added in the delivery phases and = {((u,t), {v,t + 1 )) I 

{v, t + 1) G Ct+i and t = 0, ...,t]y} represents the arcs added in the storage phase. 

Observation 4.1 (“Well defined”). If a node v £ V is level {I, 5)-stays-active at time t, then 
there exists a replica {u,t) G Ct U BASE[t] U TAiL[t] such that {v,t) G iV*(u). 

Proof: Consider some node v £ V and a time t. If (/, (5)-stays-active at time t, then either 
(u, t) £ Base U tail or (u, t) 0 BASE[t] U TAiL[t] and v is also {1, 5)-stays-active at time t — 1 (and 
Ct n ]^^(u)[t] / 0); hence, (BASE[t] U TAiL[t] UCt) C ]^*(u)[t] / 0. The observation follows. | 
Moreover, a stays-active node v has a copy in its neighborhood longer (for an additional round). 
Observation 4.2 (“A (Z, (5)-active node has a near by copy”). If a node v is {1,5)-active at 
time t, then, either (1) (Base U tail) n i^*(u)[t] 7 ^ 0, or (2) i^*(u)[t] nCf / 0. 

Proof: Consider a node v £ V that is (Z, (5)-active at time t. If (Base U tail) n Z^^(u)[Z] 7 ^ 0, 
then the observation follows. Assume that (BaseU tail) n l^\v)[t] = 0. Then, the fact that v is 
(Z,5)-active at t, but {v,t) 0 BaseU tail, implies also, that v is (Z, 5)-stays-active at time t — 1. 
Thus, either ( 1 ) P{v) commit at Z — 1 (at step (Sl.l)) which “cause” adding an additional replica 
to Ct from l^^v) (at step (SI. 2 )); or ( 2 ) P{v) does not commit at Z — 1 , since Ct has, already, a 
replica from | 

Observation 4.3 (“Bound from above on |^°’^|”). \ P^[(uo,0), (uq,Zv)]! < |commit|. 

Proof: Let \'Pyt[(To, 0), (tq, Zv)]- Now we prove that = |commit|. Every 

arc in (that add at step (SI.3)) corresponds to exactly one tuple (I, Z) of an interval I that 

commits at time Z (in step (Sl.l)); and every interval commits at most once in each time Z that 
corresponds to exactly one additional arc in A-vq- Thus, = |commit|. The observation 

follows. I 

Analysis of D-Line°'^ We, actually, compare the cost of Algorithm D-Line°'^ to that of the 
pseudo online Algorithm Square. The desired competitive ratio for D-Line°^^ will follow, since 
we have shown that Square approximates the optimum (Theorem 13.911 . A similar usage of a (very 

different) pseudo online algorithm utilized in [15]. '^°p*J^^~QyARE ~ ^( ^giogn )- implies 
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the desired competitive ratio of O( io^giog^ ) by Theorem 13.91 We first show, that the number of 
horizontal edges in ( “delivery east”) is O (5 • cost (Square, 7^)). Then, we show, that the the 

number of arcs in {“storage cost”) is O (log^ n • cost(SQUARE, 7^)). Optimizing 6, we get a 

competitiveness of O( iogfogn )- 

Delivery cost analysis. For each request G TZ, the delivery phase (step (D2)) adds = 

to the solution. Define the online radius of r* as = d{q^^,ri). We have, 

N 

(24) 

i=l 

It remains to bound as a function of p^^{i) from above. Restating Observation 14.21 somewhat 
differently we can use the distance Vi — < 5p^^{i) (see (SQ3)) and the time difference 

ti — < 5p^Q(f) for bounding p^^. That is, we show that D-Line°^ has a copy at time U (of 

ri) at a distance at most 105p^Q(f) from (of of Square). Since, u* — < 5p^^{i), 

D-Line°'^ has a copy at distance at most (lOd + 5)p^^{i) from Vi (of ri). 

Lemma 4.4 pf^ < (lOd + 5) • p^^{i). 

Proof: The following claim restating Observation 14.21 somewhat differently and help us to prove 
that the serving replica has a “near by” copy. 

Claim 4.5 Consider some base replica {v,t) G BaseUtail and some p > 0, such that, t + p < t^- 
Then, there exists a replica {w,t + p) G Ct+p such that v — w <25 p. 

Proof: Assume that {v,t) G Basetail. Consider an integer p > 0. Let I = [log^p]. Node v is 
(Z, (I)-active at time t + p. Thus, by Observation 14.21 there exists some node w G I\^{v) that keep 
a copy for time t + p. That is, a replica {w,t + p) G J^\v)[t + p] O Ct+p does exists. The fact that 
w G l^\v) implies that v — w <25K The claim follows, since p > 5^~^. | 

Recall that Square serves request r* = {vi,ti) from some base replica 
already include in the solution. That may correspond to some earlier time. That is, < 

ti- In the case that = ti, D-Line°^ can serve from Hence, p°^ < 5p^Q(z). In 

the more interesting case, < ti. By Claim 14.51 (substituting v = u®®^^®, t = s?®!"^®^ and 

p = ti — s?®^'^® < 5p^Q(f)), there exists a replica {w,ti) G Cj. such that u®®’^'^® — w < 105p^^{i). 

Recall that Vi — < 5p^^{i) (see (SQ3)). Thus, Vi — w < (lOd + 5)p^^{i). Hence, p™ < 

{105+ 5)p^^{i) as well. I 

The following corollary holds, by combining together the above lemma with Inequality (I24p . 

Corollary 4.6 < (lOd + 5) • cost (Square, 7^). 

Analysis of the storage cost By Observation 14.31 it remains to bound the size of | commit| from 
above. Let commR(/, 7) = lif(/,7) G commit (otherwise 0). Hence, |commit| = X^/ex ^)' 

We begin by bounding the number of commitments in D-LiNE°'^ made by nodes for level I = 0. 
Observation liTTl below follows directly from the definitions of commit and stays-active. 

Observation 4.7 Yliexe{i)=o Y)t=o commit {I, t) < 

Proof: Consider some commitment {I, t) G COMMIT, where interval I is of level i{I) = 0. Interval 
I commit at time t only if there exists a node u G / such that v is {I = 0, (5)-stays-active at t 
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(see step (SI) in D-Line°'^). This stays-active status at time t occur only if {v,t) G Base U tail. 
Hence, each base replica causes at most one commitment at t of one interval of level 1 = 0. | 

The following lemma is not really new. The main innovation of the paper is the special pseudo 
online algorithm we developed here. The technique for simulating the pseudo online algorithm by 
a “true” online one, as well as the following analysis of the simulation, are not really new. For 
completeness we still present a (rather detailed) proof sketch for Lemma 14.81 Its more formal 
analysis is deferred to the full paper (and a formal proof of a very similar lemma for very similar 
mapping of undirected MCD) can be found in Lemma 3.8 of |14j . 

Lemma 4.8 |commit| < (1 + 41og^ n)| . 

Proof sketch: The 1 term in the statement of the lemma follows from Observation K7\ for com¬ 
mitments of nodes for level I = 0. The rest of the proof deals with commitments of nodes for level 
I > 0 . 

Let us group the commitments of each such interval (of level I > 0) into “bins”. Later, we shall 
“charge” the commitments in each bin on certain costs of the pseudo online algorithm SQUARE. 
Consider some level I > 0 interval I G X(5) an input TZ. We say that / is a committed-interval if I 
commits at least once in the execution of D-Line°^ on TZ. For each committed-interval I (of level 
£(/) > 0), we define (almost) non-overlapping “sessions” (one session may end at the same time 
the next session starts; hence, two consecutive sessions may overlap on their boundaries). The first 
session of I does not contain any commitments (and is termed an uncommitted-session)] it begins 
at time 0 and ends at the first time that I contains some base replica. Every other session (of I) 
contains at least one commitment (and is termed a committed-session). 

Each commitment (in D-LiNE°'^) of I belongs to some committed session. Denote by pivotal) 
the leftmost node in I, i.e., pivotal) = min{u | v G /}. Given a commitment {I,t) G COMMIT that 
I makes at time t, let us identify (/, t)’s session. Let t~ < t he the last time (before t) there was a 
base replica in pivot{I). Similarly, let t~^ > t he the next time (after t) there will be a base replica 
in pivot{I) (if such a time does exist; otherwise, t~^ = oo). The session of commitment {I,t) starts 
at t~ and ends at t~^. Similarly, when talking about the i’s session of interval I, we say that the 
session starts at t)~(L) and ends at tf{I). When I is clear from the context, we may omit (/)and 
write t~, tf. A bin is a couple {I,i) of a committed-interval and the ith commitment-session of I. 
Clearly, we assigned all the commitments (of level I > 0 intervals) into bins. 

Before proceeding, we claim that the bins indeed do not overlap (except, perhaps, on their 
boundaries). This is because the boundaries of the sessions are times when pivot{I) has a Base 
replicas. At such a times t*, I does not commit. This is because the pivot of / is (Z = 0,5)- 
stays-active at t* and hence keeps a copy. On the other hand, I is of higher level (we are dealing 
with the case of I > 0); hence, it is treated later by the algorithm (see step (SI)). Hence, I indeed 
does not commit at t*. Therefore, there is no overlap between the sessions, except the ending and 
the starting times. That is, <)•••,< > where i' is the number of bins 

that I has. 

Let us now point at costs of algorithm Square on which we “charge” the set of commitments 
COMMIt(I, i) in bin (I, i) for the Zth session of I. We now consider only a bin (I, i) whose committed 
session is not the last. Note that the bin corresponds to a rectangle of |I| by tf—t~ replicas. Expand 
the bin by |/| replicas left, if such exist. This yields the payer of bin (/, i); that is the payer is a 
rectangle subgraph of |i^^(I) U I\ by tf — tu replicas. We point at specific costs Square had in 
this payer. 
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Recall that every non last session of I ends with a base replica in pivot{I), i.e., {pivot{I),tf) G 
Base U TAIL. The solution of Square contains a route (Square route) that starts at the root and 
reaches {pivot{I),tf) by the definition of a base replica. For the charging, we use some (detailed 
below) of the edges in the intersection of that Square route and the payer rectangle. 

The easiest case is that the above Square route enters the payer at the payer’s bottom {t~) and 
stays in the payer until tf. In this case (EB, for Entrance from Below), each time {t~ < t < tf) 
there is a commitment in the bin, there is also an arc at in the Square route (from time t to time 
t + 1). We charge that commitment on that arc Of. The remaining case (SE, for Side Entrance) is 
that the Square route enters the payer from the left side of the payer. (That is. Square delivers 
a copy to pivot{I) from some other node u outside J’s neighborhood, rather than stores copies 
at pivot(iys neighborhood from some earlier time). Therefore, the route must “cross” the left 
neighbor interval of I in that payer. Thus, there exists at least |I| = horizontal edges in the 
intersection between the payer {payer{I,i)), of (/, f) and the Square route. 

Unfortunately, the number of commitments in bin (/, i) can be much grater than However, 
consider some replica {v,t*) G (Base U tail) n where t* is the last time there was a base 
replica in I at its i’th session. The number of commitments in bin (I, i) corresponding to the 
times after t* is at most. (To commit, an interval must have an active node; to be active, 
that node needs a base replica in the last times.) The commitments of times t* to tf are 
charged on the horizontal edges in the intersection between payer{I, i) and Square’s route that 
reach (pivot {I), tf). Recall that, on the one hand, there are commitments at most in bin (/, *) 
corresponding to times t* < t < t~^. On the other hand, there exists at least horizontal edges 
in the intersection between Square route and payer (I). 

We charge the commitments of times t~ to U — 1 on the arcs in the intersection between the 
payer {payer{I, i)), of (I, i) and the Square’s route that reaches [v, t*). (The route of Square that 
reach (u, t*) must contain an arc at = {{u, t), {u, t + 1)) in payer{I, i) for every time t G [t~,t* — 1]; 
this implies that in each time (tf < t < t*) there is a commitment in the bin, there is also an arc 
at in Square solution (from time t to time t + 1); we charge that commitment on that arc at-) 

For each interval I, it is left to account for commitments in I’s last session. That is, we now 
handle the bin (I, i') where I has i' commitment-sessions. This session may not end with a base 
replica in the pivot of I, so we cannot apply the argument above (that Square must have a route 
reaching the pivot of I at tf). On the other hand, the first session of / (the uncommitted-session) 
does end with a base replica in pivot(I), but has no commitments. Intuitively, we use the payer of 
the first session of I to pay for the commitments of the last session of I. Specifically, in the first 
session, the Square route must enter the neighborhood of I from the left side; Hence, we apply 
the argument of case SE above. 

To summarize, ( 1 ) each edge that belongs to Square’s solution may be charged at most once 
to each payer that it belongs too. (2) each edge belongs to 4 log^ n payers at most (there are log^ n 
levels; the payer rectangle of each level is two times wider than the bins; two consecutive sessions 
may intersect only at their boundaries]^. This leads to the term 41og^n before the in the 

statement of the lemma. | 

We now optimize a tradeoff between the storage coast and the delivery cost of D-Line°'^. On 
the one hand, Lemma fd. 81 shows that a large 6 reduces the number of commitments. By Observation 
SSI this means a large 6 reduces the storage cost of D-Line°^. On the other hand, corollary 14.61 

® Note that, unlike the analysis of Line®'’^ for undirected line network [EHIl], we don’t claim that each arc is 
charged just for constant number of times. 
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shows that a small 5 reduces the delivery cost. To optimize the tradeoff (in an order of magnetite), 
~ r iog°fogn l• Thus, log^n = Q( io'giog,t )- Corollary 14.61 Lemma ITSl and Observation 14.31 imply 

that cost(D-LiNE°'^, 7^) = O (^^ Thus, by Theorem 13.91 we have the proof of 
the following theorem. 

Theorem 4.9 Algorithm D-Line°^^ is O ( i^gf^g n )'competitive for DMCD problem. 

5 Optimal algorithm for RSA and for DMCD 

Algorithm D-LiNE®*^ in Section [4] solves DMCD. To solve also RSA, we transform Algorithm 
D-Line®'^ to an algorithm RSA®'^ that solves RSA. First, let us view the reasons why the solution 
for DMCD (Section |4]) does not yet solve RSA. In DMCD, the X coordinate of every request 
(in the set TZ) is taken from a known set of size n (the network nodes {1,2, ...,n}). On the other 
hand, in RSA, the X coordinate of a point is arbitrary. (A lesser obstacle is that the Y coordinate 
is a real number, rather than an integer.) The main idea is to make successive guesses of the 
number of Steinr points and of the largest X coordinate and solve under is proven wrong (e.g. a 
point with a larger X coordinate arrives) then readjust the guess for future request. Fortunately, 
the transformation is exactly the same as the one used in [HUE] to transform the algorithm for 
undirected MCD to solve SRSA. For completeness, we nevertheless present the transformation 
here. 

5.1 Proof Outline 

The following outline is taken (almost) word for word from |15] . (We made minor changes, e.g. 
replacing the word SRSA by the word RSA). 

First, let us view the reasons why the solution for DMCD (Section [3|) does not yet solve RSA. 
In DMCD, the X coordinate of every request (in the set TZ) is taken from a known set of size n (the 
network nodes {1,2, ...,n}). On the other hand, in RSA, the X coordinate of a point is arbitrary. 
(A lesser obstacle is that the Y coordinate is a real number, rather than an integer.) The main 
idea is to make successive guesses of the number of Steinr points and of the largest X coordinate 
and solve under is proven wrong (e.g. a point with a larger X coordinate arrives) then readjust 
the guess for future request. Let us now transform, in three conceptual stages, D-Line°'^ into an 
optimal algorithm for the online problem of RSA: 

1. Given an instance of RSA, assume temporarily (and remove the assumption later) that the 
number N of points is known, as well as M, the maximum X coordinate any request may 
have. Then, simulate a network where n > N and -^logre = 0{y/logN), and the n nodes are 
spaced evenly on the interval between 0 and M. Transform each RSA request to the nearest 
grid point. Solve the resulting DMCD problem. 

2. Translate these results to results of the original RSA instance. 

3. Get rid of the assumptions. 

The first stage is, of course, easy. It turns out that “getting rid of the assumptions” is also relatively 
easy. To simulate the assumption that M is known, guess that M is some Mj. Whenever a guess 
fails, (a request rj = {xi,ti) arrives, where Xi > Mj), continue with an increased guess M^+i. A 
similar trick is used for guessing N. In implementing this idea, our algorithm turned out paying a 
cost of YMj. (This is Mj per failed guess, since each application of Square to a new instance, for a 
new guess, starts with delivering a copy to every node in the simulated network; see the description 
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of Algorithm SQUARE.) On the other hand, an (optimal) algorithm that knew M could have paid 
M only once. IF Mj+i is “sufficiently” larger than Mj, then T,Mj = 0{M). 

The second stage above (translate the results) proved to be somewhat more difficult, even in 
the case that N and M are known (and even if they are equal). Intuitively, following the first 
stage, each request rj = is inside a grid square. The solution of DMCD passes via a corner 

of the grid square. To augment this into a solution of RSA, we need to connect the corner of the 
grid square to r^. This is easy in an offline algorithm. However, an online algorithm is not allowed 
to connect a point at the top of the grid square (representing some time t) to a point somewhere 
inside the grid square (representing some earlier time t — e). 

Somewhat more specifically, following the first stage, each request r* = {xi,ti) is in some 
grid square, where the corners of the square are points of the simulated DMCD problem. If we 
normalize M to be N, then the left bottom left corner of that square is ([xjj, [tij)). Had we 
wanted an offline algorithm, we could have solved an instance of DMCD, where the points are 
([xij, [tij), (L3;2j) L^ 2 j)) (L^aJ) L^sJ)) Then, translating the results of DMCD would have meant 
just augmenting with segments connecting each ([xjj, [t^J) to {xi,ti). Unfortunately, this is not 
possible in an online algorithm, since {xi,ti) is not yet known at ([Uj). Similarly, we cannot use 
the upper left corner of the square (for example) that way, since at time [tj], the algorithm may 
no longer be allowed to add segments reaching the earlier time tj. 

5.2 Informal description of the transformed RSA algorithm assuming n/2 < 
maXj,Q < n and f/n < N < n and n is known 

The algorithm under the assumptions above appears in Figure [T5j Below, let us explain the 
algorithm and its motivation informally. 

When describing the solution of DMCD, it was convenient for us to assume that the network 
node were {!,...,n}. In this section (when dealing with RSA), it is more convenient for us to 
assume that D-Line°'^ solves DMCD with the set of network nodes being {0, ...,n — 1}. Clearly, 
it is trivial (though cumbersome) to change D-Line°'^ to satisfy this assumption. 

Assume we are given a set of points Q = {pi = (xi, yi,..., (xat, j/at))} for RSA. We now translate 
RSA points to DMCD requests (Fig. [TT)l . That is, each point pi = {xi,yi) that is not already 
on a grid node, is located inside some square whose corners are the grid vertices. We move point 
Pi = {xi,yi) to the grid vertex (replica) r* = {vi,ti) on the left top corner of this square. That is, we 
move Pi (if needed) somewhat later in time, and somewhat left on the X axis. We apply D-Line°'^ 
to solve the resulting DMCD. This serves r* = (uj, U) from some other replica {u, ti), where U may 
be slightly later than the time y* we must serve pi. After Square solves the DMCD instance, we 
modify the DMCD solution to move the whole horizontal route RX^{i) of request rj (route from 
g™ = (uf^,ti) to Xi = {vi,ti) somewhat earlier in time (from time U to time yi). This now serves 
a point {vi,yi), where Vi may be slightly left of Xj. Hence, we extend the above horizontal route 
by the segment from {vi,yi) to pi = {xi,yi). In addition, the transformed algorithm leaves extra 
copies in every network node along the route T-L‘^^{i), until time U (see Fig. fTbl dil: a little more 
formally, the algorithm adds to the solution of RSA the vertical line segment Lyer{{k,yi), {k,ti)) 
(a vertical segment between the points {k,yi) and {k,ti)), for every k such that (fe,U) € V°^(i). 

There is a technical point here: D-Line°'^ had a copy in (u, U) and we need a “copy” in {u,yi) 
where U — 1 < y < ti. That is, we need that the solution of RSA problem will already includes 

{u,yi)- 

Observation 5.1 The solution of USA problem already includes {u,yi). 
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Figure 14: Point pi is transformed upward and leftward to 71 ; p 2 is transformed upward and p^ is is 
transformed leftward; the points transform to the same vertex point. 

Proof: To make sure such a copy in {u, yi) does exist, let us consider the way the copy reached 
{u,ti) in D-Line°^. If D-Line°'^ stored a “copy” in u from time t — 1 to t (see Fig. fTHI c) ), then 
also {u, yi) belong to the solution. Otherwise, D-Line°’^ moved the copy to (u, ti) over a route Vp, 
from some other grid vertex 

Note that appeared in the transformed algorithm because that algorithm served a point 

Pi = {xi,yi), of a time ti — 1 < yi < U (see Fig. [TGi dli. The transformed algorithm moved this 
route T’'n[{w,t), {u,t)] earlier in time to 'PH[i'>x,yi), {u,yi)] and left copies in those network node 
until time ti (see Fig. [T 6 l dlb In particular, it leave a copy also in u from time yi to time ti, hence 
(u, yi) is already in the solution of the transform algorithm. | 

So far, we described how to transform the delivery phase of D-Line°'^. The storage phase of 
D-Line°'^ does not need to be transformed. (Actually, DMCD even has some minor extra difficulty 
that does not exist in RSA; consider some request rj_i = (uj_i,ti_i) in DMCD, and suppose that 
the next request rj = {vi,ti) is at time ti = + 10 ; then time t + 1 arrives, and D-Line°^^ must 

make some decisions, without knowing that the next request will be at time ti_i + 10 ; then time t +2 
arrives, etc; no such notion of time passing (without new points arriving) exists in the definition 
of RSA; that is, the Y coordinate y* of the next request pi = {xi,yi) is known right after the 
algorithm finished handling pi = {xi,yi)', the storage phase of the transformed algorithm does not 
make any use of this extra freedom in RSA and simulates the “times”, or the Y coordinates, one 
by one; note that for that purpose, the transformation of the delivery phase ensured the following 
property: that if a copy in DMCD exists in a replica {v,t) in D-Line°^, this replica also contains 
a copy in the transformed algorithm.) Denote the solution of RSA°'^ on Q by For the 

pseudo code, see Fig. [151 

Analysis sketch of the transformed algorithm with known parameters It is not hard to 
see that an optimal solution for that instance of DMCD is “not that far” from an optimal solution 
of the original instance of RSA. To see that, given an optimal solution of RSA, one can derive a 
feasible solution of the resulting DMCD by adding 2 segments of length at most 1 for each point 
p. (One vertical such segment plus a horizontal one are enough to connect a point p to the replica 
{v,t) where we moved p). The total of those distances is 2n at most. On the other hand, an 
optimal solution of RSA would need to pay at least max 2 ;Q > n/2. Hence, an optimal solution for 
DMCD would have implied a constant approximation of RSA. Intuitively, an approximation (and 
a competitive ratio) for DMCD implies an approximation (and a competitive ratio) of RSA in a 
similar way. For a given Algorithm A for RSA and a set Q of input points, let cost{A, Q) be the 
cost of A on Q. Let opt be an optimal algorithm for RSA. 










1. For Pi do: 

(a) compute the translated request ri = of pi; 7^ ■(— {ri}. 

(b) ^ {L^er((0,0), {0,yi)),Lhor{{0,yi), {xi,yi))}; 

2. For each point Pi G Q\ {pi} = {p 2 = ( 2 ^ 2 , 2 / 2 ), ■■■,PN = (2/Af,2/Af)} do: 

(a) compute the translate request = {vi,ti) of pp, 

(b) 7^^7^U{r^}. 

(c) “Vertical phase” 

i. If ti > ti-i, then for each time t = ti-i, — 1 do: 

A. “Simulate” D-Line°^ on TZ to find Ct+i- 

B. + I {v,t + l)GCt+l}. 

(d) “Horizontal phase” 

i. “Simulate” D-Line°^ on TZ to find 

ii. ^ U {Lhor{iuf^,yi), {vi,yi))}. 

hi. ^ U {Lhor{{vi,yi), {xi,yi))} 

iv. ^ U {Lyeriiu, yi), {u, ti)) I (ti, ti) E Pv^(t)}. 

3. Return 

Figure 15: Subroutine RSA°’^ assumes the knowledge of n and that n/2 < maXxQ < n and 
< N < n. 


Lemma 5.2 Assume that maXxQ < n and N < n. Then, cost(RSA°^, Q) = O ( 10 '° ^ (cost (opt, Q)+ 
n)). If also < N and n/2 < maXxQ, then RSA™ is O ( iJ” ) - competitive for RSA. 

Proof: It is easy to verify that RSA®'^ computes a feasible solution (see the “technical point” 
comments in parentheses in sectionConsider some input point set Q = {pi = (xi, t/i), ...,pn = 
(xntVn)} such that max 3 ;Q < n and N < n. Let TZ = {ri = {vi,ti), ...,rN = {vN,tN)} be the 
translated instance of the MCD problem. 

Recall how does RSA°'^(Q) translate the solution of 
D-Line°’^( 7?.). An horizontal edge {{u,ti), (u + l,ti)) E 'P^{i) (that D-Line°'^ add to its solution 
when handling request r*, see step (D2) in D-Line°^) is translated into a horizontal line segment 
Lhor{{u, yi), {u + 1, yi)). An arc {{u, t), (w, t +1)) E (of D-Line^’^’s solution on TZ) is translated 
into a vertical line segment Lxer{{u, t), (uAT^))- Hence, the total cost of those parts of the solution 
of RSA™(Q) is exactly the same as the cost of the solution of D-Line°^(7?.). 

Thus, the cost of RSA®'^ on Q differ from the cost of D-Line°^ on TZ only by two kinds of “short” 
segments (Segment of length at most 1). For the first kind, recall (technical point in Section f5.2p 
that for every moved horizontal path Pnliw,/)), {u,y)], RSA°^ added a short vertical segment for 
every network node w' of that path from {w',y) to {w', [y]). The second kind of addition is an 
horizontal short segment connecting the input point p = {x,y) to {u,y), where u= [xj. 
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The total cost of the second kind is bounded by n, since \vi — Xi\ <\. We claim that the total 
cost of the short segment of the first kind is cost(D-LiNE°'^, 7?.) at most. To see that, notice that 
we have at most 1 such “short” segment (shorter than 1) per replica that appears in the solution of 
D-Line°'^ on TZ. That solution of D-Line°^ contains at least as many edges as it contains replicas. 
Formally, the cost of RSA°'^ is at most, 

cost(RSA™, Q) = 

n 

cos^(D-LINE™,7^) + ^ ■ {ti - yi) + \vi - Xi\) 

i=l 

< 2cost(D-LiNE°^, 7^) + n. 


Thus, by Theorem 14.91 


cost(RSA°'^, 


Q) < Cl 


log n 
log log n 


■ cost(OPT, TV) + n. 


(25) 


where ci is some constant. 

Let us look the other direction, from an optimal solution of RSA for Q to optimal solution of 
DMCD for TZ. Recall that can be served from pi at a cost of 2 (at most). Hence, 


cos7(opt,7^) < cost(OPT, Q) + 2n. 


(26) 


Thus, by Inequalities (f25]l and (f^ . 


cost(RSA°^, Q) 
log n 


< Cl 


log log n 


■ (cost(OPT, Q) + 2n) + n 


= 0{ . • (cost(OPT, Q) + n). 

log log n 


(27) 


The hrst statement of the lemma holds. Now, let us prove the second statement of the lemma. 
Assume that maXa;Q/2 < N <n and -^n < N <n. Thus also. 


cost{oPT, Q) > njl. 


(28) 


Therefore, by Inequalities (1271) and 


cost(RSA°^, Q) 
cost(OPT, Q) 

^ ^ (h -^n + n 


< (5ci + 1) 


COSt{OPT, Q) 
log n 


n/2 


log log n 


The lemma follows, since -^logn < N. | 

Below, RSA™ is used as a module in another algorithm, responsible for implementing the 
assumptions. In each execution of the other algorithm, RSA™ is invoked multiple times, for multiple 
subsets of the input. Unfortunately, not every time, the other algorithm uses RSA™, all the 
assumptions are ensured. This is the reason of the “extra” factor n-^log n in the first part of the 
above lemma above. Fortunately, these extra factors of all the invocations are bounded separately 
later. 
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Figure 16: Example of execution of RSA°‘^. (a) RSA°‘^’s solution after handling point pi-i, (b) RSA®*^ 
simulates the storage phase of D-Line°‘^ on TZ for times t = U-i, — 1; (c) RSA®'^ handles point 
Pi, moves V^{i) from "time” ti to "time” yi (it serves this path from {u,yi), who "receives a copy” 
when D-Line°^ handles time — 1 in the storage phase), and “leaves copies” at the nodes 
from “time” y^ to “time” ti', (d) RSA°'^ handles point pj+i, moves V^{i + l) from “time” ti to “time” 
yi+i (it serves this path from {w,yi+i), who “receives a copy” when handling point pi), and “leaves 
copies” at the nodes of + 1) from “time” yj+i to “time” tj. 
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5.3 Getting rid of the assumption that M=N 

We now describe an online algorithm that is somewhat more general than RSA°'^. Al¬ 

gorithm RSA^^p is not based on the assumption that the upper bound M on max^jQ is also the 
number of points. That is, we now do not assume that M = N. Getting rid of this assumption 
is straightforward. The new online algorithm RSA™^ ^ transforms the X coordinate of each input 
point to the interval [0, re]. Algorithm RSA^^^ passes the transformed point to the online algorithm 
RSA™ of Section [22] that is assumed to be executing in parallel. The transformation of a point is, 
though, a little more involved, as detailed below. 

Later on (in Section 15.411 , RSA^^ ^ will be used by an even more general algorithm in a similar 
way. For that, it is more convenient for us to define algorithm RSA^^^ a somewhat more general 
algorithm then is needed by the description so far. We now assume that the origin is not necessarily 
(0,0), but is rather some po = (0, yo)- (Meanwhile, we still assume that xq = 0). Hence, algorithm 
RSA^^p translates the X coordinate of each input point p = {x,y) to f{x) = x ■ To keep the 

proportion between the axes, the y coordinate y is translated to f{y) = {y — yo) ■ (Recall that 

y > to-) Finally, the solution of RSA°^ is translated back to the coordinates of RSA^^^ applying 
the transformation f~ to every point of the solution. (Clearly, this is a polynomial task, since the 
solution is described using a polynomial number of points). The pseudo code appears in Fig. HTj 
By Lemma 122] and the description of RSA^^^, it is easy to see the following. 

Observation 5.3 Assume that maXxQ < M and N < n. Then, cost(RSA^^p, Q) = Q( io|riogn (cosf(OPT, Q)+ 
M)). If also < N and M/2 < maXxQ, then RSA^^^ is 
Q ( log fofAf ) ~ competitive for RSA. 


• origin is po = (xo,yo)- 

1. Q'^0. 

2. For each point pi G Q do: 

(a) p' = (x',y') ^ f{pi,M,n,yo); 

(b) Q'^Q'u{pa- 

(c) Call RSA™ as a subroutine on Q' to find 

(d) f-HJ^^^HQ'),M,n,yoy, 

Figure 17: Algorithm RSA™^^. 


5.4 Getting rid of the knowledge assumptions 

To give up the assumption that max^jQ is known, we use a standard trick. We first guess that 
maXajQ is “about” twice the X coordinate of the first point. Whenever the guess for max^jQ is 
proven wrong (some pi = {xi,yi) arrives with Xj larger then our guess for max 2 ,Q), we double 
the guess. We do not change the solution for the points we already served. Simply, the points 
that arrive from now on, are treated as a new instance of RSA, to be solved (by RSA™^^) by a 
translation to a new instance of DMCD. Intuitively, every instance of DMCD may need to pay an 
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additional cost that is proportional to our current guess of max^jQ. This is justified by the fact that 
(1) the new guess is at least double our previous guess of max^^Q; and (2) any optimal algorithm 
would need now to pay max^-Q guessed before. (A minor technical point is that the origin of the 
new instance of RSA may not be po = (0,0); instead, the new origin is (0,yi_i), where t/i-i is the 
y-coordinates of the last point served.) 

For justifying the other assumption, that the number of points is known in advance, we use a 
similar trick; however, its justification is more complex. That is, if the number of points grows larger 
beyond our current guess, n-guess, we increase our guess of the number of points. We then start a 
new instance of with the new guess. (In turn, this leads to a new activation of D-Line°'^ 

with n-guess'^^'^ as the new network size.) Hence, we start a new DMCD instance with an increased 
“network size”. The “new” guess n-guess"'^^ of the number of RSA points is (not doubled but) the 
power of 4 of our “current” n-guess (yielding a double exponential sequence). Each new DMCD 
instance is associated with a cost of 0{y/\og n-guess^^'^maxxQ) at most. Thanks to using a double 
exponential groth rather than an exponential growth, this would increase the competitive ratio just 
by a factor of 0(loglogA"). Clearly, one should not increase the guess (of the number of points) 
more than polynomially each time (since otherwise, for the last guess n-guess, the value would 
have been too high compared to the desired competitive ratio.) Summarizing the above 

informal description, given an instance of RSA, we use “guesses” of maXxQ and N to partition the 
points Q into subsets. Each such subset defines a problem we translate separately to DMCD via 

p C A 

Given an instance of RSA, we now define its partition of multiple instances. Eor that, we define 
the partition of Q into subsets Q(l), Q(2), .... The first |Q(1)| points will belong to Q(l), the next 
|Q(2)| will belong to Q{2), etc. We shall also show how to detect online the first point in Q{2), the 
first in Q(3), etc. Before that, we must tackle some technicality. The original RSA problem with 
defined for an origin of A = 0 and T = 0. However, after solving for the RSA instance Q(l), the 
next point is at Y coordinate that is larger than zero. Moreover, when solving DMCD, we allowed 
the origin to be at any node (that is, in any X coordinate). Hence, it is convenient to generalize 
the definition of the RSA to the setting were the input includes an origin point po = {xo,yo), in 
the positive quadrant. The input point set Q{k) includes only points (in the positive quadrant), 
whose y-coordinates are grater than or equal to yo. 

Consider a point set Q = {pi, ...,pjy}. Algorithm RSA°'^ partitions Q into subsets as follows. 
Eor every i = 1,N define that 

M-guess{i) = 2^ , (29) 

where I' = [logmaxlxj | j = l,...,i}], and 

n-guess{i) = 2^ , (30) 

where I* is integer such that I* = min;(2^^* > i). Note that, 2^^*^^ = = (2^^*)^. Hence, the 

growth of the sequence 2^ ,2^ ,2^ ,... is for the power of 4. 

Let us use the above guesses to generate the subset. Specifically, we generate a sequence 
91 < 92 < ••• < 9t (for some gr) of separators between consecutive subsets. That is, Q(l) = 
{pgj, ...,Pg 2 -i}) then Q(2) = {pg^, ...,pg^-i}, etc. A separator is the index of a point where one 
of the guess fails. Specifically, let yi = 1 and if M-guess{gk) < M-guess{N) or n-guess{gk) < 
n-guess{N), then let 

= mini (^M-guess{gk) < M-guess{i) oi n-guess{gk) < n-guess{i)). (31) 
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Define that the guess n-guess of Q{k) is rik = 2^^^" 9“='*®(sfc)) guess M-guess of Q{k) is 

Mk = for every k = 1, ..., r. The origin points of these subsets are defined as follows: 

Let = be the y-axis of the last point in Q{k) and let = 0 and = yf~gl (for 

k = 2, The origin point of Q{k) is Pq = (0, yg), for every k = I, ...,r (see Fig. flBI) . 

All the above functions can be computed online. As sketched, Algorithm RSA°^ handles a 
point after point, and a subset after subset. For every point pi G Q, RSA™ finds the subset 
Q{k) that Pi belongs to (i.e., pi G Q{k)), then RSA°^ passes the point to an instance of RSA^^^ 
executing (in parallel to RSA°^) on Q{k), with the origin point pg = (0,yg), and with the M-guess 
parameter M = and the n-guess parameter n = n^. Denote the solution of RSA^^^ on Q{k) 
by -^M®rfp(2(^))- The solution of RSA®*^ is the union of the solutions of RSA™^ ^ on all the subsets. 

That is, RSA^'^’s solution is ^"^^"^(2) = The pseudo code of RSA°^ is given in 

Fig. [m 



Figure 18: Partitioning Q into subsets Q(l), Q(2),..., Q(5); each instance corresponds to a subset 
Q{k), origin (0, ?/q), M-guess Mk and n-guess nk- 

Theorem 5.4 Algorithm RSA®'^ is optimal and is )-competitive. 

5.5 Optimizing DMCD for a small number of requests 

Algorithm D-LiNE°'^ was optimal only as the function of the network size. Recall that our solution 
for RSA was optimal as a function of the number of requests. We obtain this property for the 
solution of DMCD too, by transforming our RSA algorithm back to solve DMCD, and obtain the 
promised competitiveness, 0(min{ 

Algorithm D-Line°^ was optimal as the function of the network size (Theorem 14.91) . This 
means that it may not be optimal in the case that the number of requests is much smaller than 
the network size. In this section, we use Theorem 15.41 and algorithm RSA°^ to derive an improve 
algorithm for MCD. This algorithm, D-LiNE°'^, is competitive optimal (for DMCD) for any 
number of requests. Intuitively, we benefit from the fact that RSA°^ is optimal for any number of 
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1 . when the first point pi arrives 

(a) A; 1; Q(l) {pi}; Mi ^ ni ■(— 4; g(i = 1; and origin pj = (0,0). 

(b) start an instance of on Q(l); 

2. when an input point arrives pi = {xi, jji) (for i > 1), /* the points are i = 2,N * / 

(a) if Xi < Mk and i < nk, then Q{k) •(— Q{k) U {pi}. 

(b) Otherwise, {pi “open a new instance”), then 

i. A: •(— /c + 1; 

ii. Q{k) ^ {i}; 

iii. Mk ^ 

. ^o‘2(n- Quess(i)) 

IV. 

V. Po = (0,yi-i); 

vi. gk ^ i\ 

vii. start an instance of on Q{k)-, 

3. pass Pi to the instance of RSA^^^ executing on Q{k) with origin Pq; M = Mk; and 
n = nk and compute T^^^p{{pg^, ...,pi+i}). 

4. .FRSA ^^RSAu 

Figure 19: Algorithm RSA°^. 


points (no notion of network size exists in RSA). 

This requires the solution of some delicate point. Given an instance DMCD“ of DMCD, we 
would have liked to just translate the set 7^“ of DMCD requests into a set Q of RSA points and 
apply RSA°^ on them. This may be a bit confusing, since RSA®'^ performs by converting back to 
DMCD. Specifically, recall that RSA®*^ breaks Q into several subsets, and translates back first the 
first subset Qi into an the requests set TZi of a new instance DMCD^ of DMCD. Then, RSA®'^ 
invokes D-Line®*^ on this new instance DMCDJ. The delicate point is that DMCDJ is different 
than DMCDfl. 

In particular, the fact that Qi contains only some of the points of 7?.“, may cause RSA®*^ to 
“stretch” their X coordinates to fit them into the network of DMCD^. Going carefully over the 
manipulations performed by RSA°^ reveals that the solution of RSA°^ may not be a feasible solution 
of DMCD (even though it applied D-Line°^ plus some manipulations). Intuitively, the solution 
of RSA®*^ may “store copies” in places that are not grid vertices in the grid of DMCDq. Thus the 
translation to a solution of DMCDi is not immediate. 

Intuitively, to solve this problem, we translate a solution of RSA°^ to a solution of DMCDq in 
a way that is similar to the way we translated a solution of D-Line°^ to a solution of RSA. That 
is, each request of DMCD^ we move to a “nearby” point of RSA°^. This is rather straightforward, 
given the description of our previous transformation (of Section 15.2p . The details are left for the 
full paper. 
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Theorem 5.5 Algorithm D-Line™ is optimal and it 

\S^})-competitive. 

6 Lower Bound for RSA 

In this section, we prove the following theorem, establishing a tight lower bound for RSA and for 
DMCD on directed line networks. Interestingly, this lower bound is not far from the one proven 
by Alon and Azar for undirected Euclidian Steiner trees [1]. Unfortunately, the lower bound of [1] 
does not apply to our case since their construct uses edges directed in what would be the wrong 
direction in our case (from a high Y value to a low one). 

Theorem 6.1 The competitive ratio of any deterministic online algorithm for DMCD in directed 
line networks is ^( lo^gfpg^ ); implying also an ^( logio^jy ) bound for RSA. 

Proof: We first outline the proof. Informally, given a deterministic online algorithm onalGmcD; we 
construct an adversarial input sequence. Initially, the request set includes the set DIAG = {{k,k) \ 
0 < k < n}. That is, at each time step t, the request {t,t) is made. In addition, if the algorithm 
leaves “many copies” then the lower bound is easy. Otherwise, the algorithm leaves “too few copies” 
from some time t — 1 until time t. For each such time, the adversary makes another request at 
{t — k,t) for some k defined later. The idea is that the adversary can serve this additional request 
from the diagonal copy at (t — k,t — k) paying the cost of k. On the other hand, the algorithm is 
not allowed at time t to decide to serve from {t — k,t — k). It must serve from a copy it did leave. 
Since the algorithm left only “few” copies to serve time t the replica, {t, t — k) can be chosen at 
least at distance A:(logn) from any copy the algorithm did leave. Hence, the algorithm’s cost for 
such a time t is D(logn) times greater than that of the adversary. 

More formally, let 8 = [logn]. Partition the line at time t E {n/2,...,n} into [log^re — Ij 
intervals: Ii{t) = {t — 6^~^^,t — 5*], where i E {1,2,..., [log^n — Ij }. (Note that the intervals are 
well defined, since [log^ u — Ij < [log^ t \, for every n/2 < t < n, which implies that 8^ <t for every 
i = 1,..., [log^n — Ij.) Given an online algorithm onalGmcd, the adversary constructs the set of 
requests TZ as follows. Initially, TZ •(— DiAG. For each time t > nl2, denote by Vpj^Q{t) the set of 
nodes that hold the movie for time t (just before ONALGmcd receives the requests for time t). The 
adversary may add a request at t according to VALcit)- In particular, if onalGmcd leaves a copy 
in at least one of the nodes of every such intervals Ii{t), for i = 1, ..., [log^n — Ij, then the only 
adversary request for time t is {t,t) (while ONALGmcd left copies in at least [log^n — Ij nodes). 
Otherwise, the adversary adds the request (t — <5* , t) to TZ, where i* is an arbitrary index such that 
Ii*{t) n Ualg(^) = 0- That is, the adversary request set of time t is {(t,t)} in the first case and 
{(t — 5* , t), (t, t)} in the second case. 

For each time t = [n/2j,...,n, one of the following two cases hold: (1) ONALGmcd pays at 
least [log^re — Ij = D( ^J°^^g^ ) for storing at least [log^n — Ij copies from time t — 1 to time t, 
while the adversary pays just 2 = 0(1) (to serves request {t,t)); or (2) onalGmcd pays, at least, 
d* — (5® = D((5* ■’■^) for delivering a copy to {t — 8\ , t) from some node outside the interval li* (t), 
while the adversary pays 0{8^*) for storing the movie in node t — <5®* from time t — <5®* to time t 
(that is, serving from replica {t — 8*,t — 8*) on the diagonal) and additional two edges (to serve 
request {t,t)). Thus, in that case, ONALGmcd pays at least O(logn) times more than the adversary. 
This establishes Theorem 16.11 | 
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